Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia

From Wikipedia Quality
Revision as of 08:14, 18 July 2019 by Zoe (talk | contribs) (infobox)
Jump to: navigation, search


Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia
Authors
Alessandro Raganato
Claudio Delli Bovi
Roberto Navigli
Publication date
2016
Links
Original

Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia - scientific work related to Wikipedia quality published in 2016, written by Alessandro Raganato, Claudio Delli Bovi and Roberto Navigli.

Overview

The hyperlink structure of Wikipedia constitutes a key resource for many Natural Language Processing tasks and applications, as it provides several million semantic annotations of entities in context. Yet only a small fraction of mentions across the entire Wikipedia corpus is linked. In this paper authors present the automatic construction and evaluation of a Semantically Enriched Wikipedia (SEW) in which the overall number of linked mentions has been more than tripled solely by exploiting the structure of Wikipedia itself and the wide-coverage sense inventory of BabelNet. As a result authors obtain a sense-annotated corpus with more than 200 million annotations of over 4 million different concepts and named entities. Authors then show that corpus leads to competitive results on multiple tasks, such as Entity Linking and Word Similarity.