Exploiting Wikipedia for Directional Inferential Text Similarity

From Wikipedia Quality
Revision as of 09:36, 1 October 2019 by RaeLynn (talk | contribs) (+ wikilinks)
Jump to: navigation, search

Exploiting Wikipedia for Directional Inferential Text Similarity - scientific work related to Wikipedia quality published in 2008, written by Leong Chee Wee and Samer Hassan.

Overview

In natural languages, variability of semantic expression refers to the situation where the same meaning can be inferred from different words or texts. Given that many natural language processing tasks nowadays (e.g. question answering, information retrieval, document summarization) often model this variability by requiring a specific target meaning to be inferred from different text variants, it is helpful to capture text similarity in a directional manner to serve such inference needs. In this paper, authors show how Wikipedia can be used as a semantic resource to build a directional inferential similarity metric between words, and subsequently, texts. Through experiments, authors show that Wikipedia-based metric performs significantly better when applied to a standard evaluation dataset, with a reduction in error rate of 16.1% over the random metric baseline.