Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia

From Wikipedia Quality
Revision as of 00:59, 28 May 2019 by Elizabeth (talk | contribs) (Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia - basic info)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia - scientific work related to Wikipedia quality published in 2016, written by Alessandro Raganato, Claudio Delli Bovi and Roberto Navigli.

Overview

The hyperlink structure of Wikipedia constitutes a key resource for many Natural Language Processing tasks and applications, as it provides several million semantic annotations of entities in context. Yet only a small fraction of mentions across the entire Wikipedia corpus is linked. In this paper authors present the automatic construction and evaluation of a Semantically Enriched Wikipedia (SEW) in which the overall number of linked mentions has been more than tripled solely by exploiting the structure of Wikipedia itself and the wide-coverage sense inventory of BabelNet. As a result authors obtain a sense-annotated corpus with more than 200 million annotations of over 4 million different concepts and named entities. Authors then show that corpus leads to competitive results on multiple tasks, such as Entity Linking and Word Similarity.