Difference between revisions of "Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia"
(Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia - basic info) |
(Int.links) |
||
Line 1: | Line 1: | ||
− | '''Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia''' - scientific work related to Wikipedia quality published in 2016, written by Alessandro Raganato, Claudio Delli Bovi and Roberto Navigli. | + | '''Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2016, written by [[Alessandro Raganato]], [[Claudio Delli Bovi]] and [[Roberto Navigli]]. |
== Overview == | == Overview == | ||
− | The hyperlink structure of Wikipedia constitutes a key resource for many Natural Language Processing tasks and applications, as it provides several million semantic annotations of entities in context. Yet only a small fraction of mentions across the entire Wikipedia corpus is linked. In this paper authors present the automatic construction and evaluation of a Semantically Enriched Wikipedia (SEW) in which the overall number of linked mentions has been more than tripled solely by exploiting the structure of Wikipedia itself and the wide-coverage sense inventory of BabelNet. As a result authors obtain a sense-annotated corpus with more than 200 million annotations of over 4 million different concepts and named entities. Authors then show that corpus leads to competitive results on multiple tasks, such as Entity Linking and Word Similarity. | + | The hyperlink structure of [[Wikipedia]] constitutes a key resource for many [[Natural Language Processing]] tasks and applications, as it provides several million semantic annotations of entities in context. Yet only a small fraction of mentions across the entire Wikipedia corpus is linked. In this paper authors present the automatic construction and evaluation of a Semantically Enriched Wikipedia (SEW) in which the overall number of linked mentions has been more than tripled solely by exploiting the structure of Wikipedia itself and the wide-coverage sense inventory of BabelNet. As a result authors obtain a sense-annotated corpus with more than 200 million annotations of over 4 million different concepts and [[named entities]]. Authors then show that corpus leads to competitive results on multiple tasks, such as Entity Linking and Word Similarity. |
Revision as of 22:33, 14 July 2019
Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia - scientific work related to Wikipedia quality published in 2016, written by Alessandro Raganato, Claudio Delli Bovi and Roberto Navigli.
Overview
The hyperlink structure of Wikipedia constitutes a key resource for many Natural Language Processing tasks and applications, as it provides several million semantic annotations of entities in context. Yet only a small fraction of mentions across the entire Wikipedia corpus is linked. In this paper authors present the automatic construction and evaluation of a Semantically Enriched Wikipedia (SEW) in which the overall number of linked mentions has been more than tripled solely by exploiting the structure of Wikipedia itself and the wide-coverage sense inventory of BabelNet. As a result authors obtain a sense-annotated corpus with more than 200 million annotations of over 4 million different concepts and named entities. Authors then show that corpus leads to competitive results on multiple tasks, such as Entity Linking and Word Similarity.