Wikipedia-Based Semantic Smoothing for the Language Modeling Approach to Information Retrieval

From Wikipedia Quality
Jump to: navigation, search


Wikipedia-Based Semantic Smoothing for the Language Modeling Approach to Information Retrieval
Authors
Xinhui Tu
Tingting He
Long Chen
Jing Luo
Maoyuan Zhang
Publication date
2010
DOI
10.1007/978-3-642-12275-0_33
Links
Original

Wikipedia-Based Semantic Smoothing for the Language Modeling Approach to Information Retrieval - scientific work related to Wikipedia quality published in 2010, written by Xinhui Tu, Tingting He, Long Chen, Jing Luo and Maoyuan Zhang.

Overview

Semantic smoothing for the language modeling approach to information retrieval is significant and effective to improve retrieval performance. In previous methods such as the translation model, individual terms or phrases are used to do semantic mapping. These models are not very efficient when faced with ambiguous words and phrases because they are unable to incorporate contextual information. To overcome this limitation, authors propose a novel Wikipedia-based semantic smoothing method that decomposes a document into a set of weighted Wikipedia concepts and then maps those unambiguous Wikipedia concepts into query terms. The mapping probabilities from each Wikipedia concept to individual terms are estimated through the EM algorithm. Document models based on Wikipedia concept mapping are then derived. The new smoothing method is evaluated on the TREC Ad Hoc Track (Disks 1, 2, and 3) collections. Experiments show significant improvements over the two-stage language model, as well as the language model with translation-based semantic smoothing.

Embed

Wikipedia Quality

Tu, Xinhui; He, Tingting; Chen, Long; Luo, Jing; Zhang, Maoyuan. (2010). "[[Wikipedia-Based Semantic Smoothing for the Language Modeling Approach to Information Retrieval]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-12275-0_33.

English Wikipedia

{{cite journal |last1=Tu |first1=Xinhui |last2=He |first2=Tingting |last3=Chen |first3=Long |last4=Luo |first4=Jing |last5=Zhang |first5=Maoyuan |title=Wikipedia-Based Semantic Smoothing for the Language Modeling Approach to Information Retrieval |date=2010 |doi=10.1007/978-3-642-12275-0_33 |url=https://wikipediaquality.com/wiki/Wikipedia-Based_Semantic_Smoothing_for_the_Language_Modeling_Approach_to_Information_Retrieval |journal=Springer, Berlin, Heidelberg}}

HTML

Tu, Xinhui; He, Tingting; Chen, Long; Luo, Jing; Zhang, Maoyuan. (2010). &quot;<a href="https://wikipediaquality.com/wiki/Wikipedia-Based_Semantic_Smoothing_for_the_Language_Modeling_Approach_to_Information_Retrieval">Wikipedia-Based Semantic Smoothing for the Language Modeling Approach to Information Retrieval</a>&quot;. Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-12275-0_33.