Concept-Based Information Retrieval Using Explicit Semantic Analysis

From Wikipedia Quality
Jump to: navigation, search
Concept-Based Information Retrieval Using Explicit Semantic Analysis
Authors
Ofer Egozi
Shaul Markovitch
Evgeniy Gabrilovich
Publication date
2011
ISSN
10468188
DOI
10.1145/1961209.1961211
Links

Concept-Based Information Retrieval Using Explicit Semantic Analysis - scientific work about Wikipedia quality published in 2011, written by Ofer Egozi, Shaul Markovitch and Evgeniy Gabrilovich.

Overview

Information retrieval systems traditionally rely on textual keywords to index and retrieve documents. Keyword-based retrieval may return inaccurate and incomplete results when different keywords are used to describe the same concept in the documents and in the queries. Furthermore, the relationship between these related keywords may be semantic rather than syntactic, and capturing it thus requires access to comprehensive human world knowledge. Concept-based retrieval methods have attempted to tackle these difficulties by using manually built thesauri, by relying on term cooccurrence data, or by extracting latent word relationships and concepts from a corpus. In this article authors introduce a new concept-based retrieval approach based on Explicit Semantic Analysis (ESA), a recently proposed method that augments keywordbased text representation with concept-based features, automatically extracted from massive human knowledge repositories such as Wikipedia. Their approach generates new text features automatically, and authors have found that high-quality feature selection becomes crucial in this setting to make the retrieval more focused. However, due to the lack of labeled data, traditional feature selection methods cannot be used, hence authors propose new methods that use self-generated labeled training data. The resulting system is evaluated on several TREC datasets, showing superior performance over previous state-of-the-art results.

Embed

Wikipedia Quality

Egozi, Ofer; Markovitch, Shaul; Gabrilovich, Evgeniy. (2011). "[[Concept-Based Information Retrieval Using Explicit Semantic Analysis]]". ACM Transactions on Information Systems Volume 29, Issue 2, April 2011, Article number 8. ISSN: 10468188. DOI: 10.1145/1961209.1961211.

English Wikipedia

{{cite journal |last1=Egozi |first1=Ofer |last2=Markovitch |first2=Shaul |last3=Gabrilovich |first3=Evgeniy |title=Concept-Based Information Retrieval Using Explicit Semantic Analysis |date=2011 |issn=10468188 |doi=10.1145/1961209.1961211 |url=https://wikipediaquality.com/wiki/Concept-Based_Information_Retrieval_Using_Explicit_Semantic_Analysis |journal=ACM Transactions on Information Systems Volume 29, Issue 2, April 2011, Article number 8}}

HTML

Egozi, Ofer; Markovitch, Shaul; Gabrilovich, Evgeniy. (2011). &quot;<a href="https://wikipediaquality.com/wiki/Concept-Based_Information_Retrieval_Using_Explicit_Semantic_Analysis">Concept-Based Information Retrieval Using Explicit Semantic Analysis</a>&quot;. ACM Transactions on Information Systems Volume 29, Issue 2, April 2011, Article number 8. ISSN: 10468188. DOI: 10.1145/1961209.1961211.