Entity Ranking Using Wikipedia as a Pivot

From Wikipedia Quality
Revision as of 10:28, 11 July 2019 by Adalynn (talk | contribs) (+ Infobox work)
Jump to: navigation, search


Entity Ranking Using Wikipedia as a Pivot
Authors
Rianne Kaptein
Pavel Serdyukov
Arjen P. de Vries
Jaap Kamps
Publication date
2010
DOI
10.1145/1871437.1871451
Links
Original

Entity Ranking Using Wikipedia as a Pivot - scientific work related to Wikipedia quality published in 2010, written by Rianne Kaptein, Pavel Serdyukov, Arjen P. de Vries and Jaap Kamps.

Overview

In this paper authors investigate the task of Entity Ranking on the Web. Searchers looking for entities are arguably better served by presenting a ranked list of entities directly, rather than a list of web pages with relevant but also potentially redundant information about these entities. Since entities are represented by their web homepages, a naive approach to entity ranking is to use standard text retrieval. Authors experimental results clearly demonstrate that text retrieval is effective at finding relevant pages, but performs poorly at finding entities. Authors proposal is to use Wikipedia as a pivot for finding entities on the Web, allowing us to reduce the hard web entity ranking problem to easier problem of Wikipedia entity ranking. Wikipedia allows us to properly identify entities and some of their characteristics, and Wikipedia's elaborate category structure allows us to get a handle on the entity's type. Authors main findings are the following. Authors first finding is that, in principle, the problem of web entity ranking can be reduced to Wikipedia entity ranking. Authors found that the majority of entity ranking topics in test collections can be answered using Wikipedia, and that with high precision relevant web entities corresponding to the Wikipedia entities can be found using Wikipedia's 'external links'. Authors second finding is that authors can exploit the structure of Wikipedia to improve entity ranking effectiveness. Entity types are valuable retrieval cues in Wikipedia. Automatically assigned entity types are effective, and almost as good as manually assigned types. Authors third finding is that web entity retrieval can be significantly improved by using Wikipedia as a pivot. Both Wikipedia's external links and the enriched Wikipedia entities with additional links to homepages are significantly better at finding primary web homepages than anchor text retrieval, which in turn significantly improved over standard text retrieval.