Experiments with Result Diversity and Entity Ranking: Text, Anchors, Links, and Wikipedia

From Wikipedia Quality
Jump to: navigation, search


Experiments with Result Diversity and Entity Ranking: Text, Anchors, Links, and Wikipedia
Authors
Rianne Kaptein
Marijn Koolen
Jaap Kamps
Publication date
2009
Links
Original Preprint

Experiments with Result Diversity and Entity Ranking: Text, Anchors, Links, and Wikipedia - scientific work related to Wikipedia quality published in 2009, written by Rianne Kaptein, Marijn Koolen and Jaap Kamps.

Overview

In this paper, authors document efforts in participating to the TREC 2009 Entity Ranking and Web Tracks. Authors had multiple aims: For the Web Track’s Adhoc task authors experiment with document text and anchor text representation, and the use of the link structure. For the Web Track’s Diversity task authors experiment with using a top down sliding window that, given the top ranked documents, chooses as the next ranked document the one that has the most unique terms or links. Authors test sliding window method on a standard document text index and an index of propagated anchor texts. Authors also experiment with extreme query expansions by taking the top n results of the initial ranking as multi-faceted aspects of the topic to construct n relevance models to obtain n sets of results. A final diverse set of results is obtained by merging the n results lists. For the Entity Ranking Track, authors also explore the effectiveness of the anchor text representation, look at the co-citation graph, and experiment with using Wikipedia as a pivot. Authors main findings can be summarized as follows: Anchor text is very effective for diversity. It gives high early precision and the results cover more relevant sub-topics than the document text index. Authors baseline runs have low diversity, which limits the possible impact of the sliding window approach. New link information seems more effective for diversifying text-based search results than the amount of unique terms added by a document. Anchor text is also very effective for entity ranking. Using Wikipedia as a pivot results in a gain of precision, but at the cost of a loss of recall.

Embed

Wikipedia Quality

Kaptein, Rianne; Koolen, Marijn; Kamps, Jaap. (2009). "[[Experiments with Result Diversity and Entity Ranking: Text, Anchors, Links, and Wikipedia]]". National Institute for Standards and Technology (NIST).

English Wikipedia

{{cite journal |last1=Kaptein |first1=Rianne |last2=Koolen |first2=Marijn |last3=Kamps |first3=Jaap |title=Experiments with Result Diversity and Entity Ranking: Text, Anchors, Links, and Wikipedia |date=2009 |url=https://wikipediaquality.com/wiki/Experiments_with_Result_Diversity_and_Entity_Ranking:_Text,_Anchors,_Links,_and_Wikipedia |journal=National Institute for Standards and Technology (NIST)}}

HTML

Kaptein, Rianne; Koolen, Marijn; Kamps, Jaap. (2009). &quot;<a href="https://wikipediaquality.com/wiki/Experiments_with_Result_Diversity_and_Entity_Ranking:_Text,_Anchors,_Links,_and_Wikipedia">Experiments with Result Diversity and Entity Ranking: Text, Anchors, Links, and Wikipedia</a>&quot;. National Institute for Standards and Technology (NIST).