Difference between revisions of "Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia - basic info)
 
(Int.links)
Line 1: Line 1:
'''Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia''' - scientific work related to Wikipedia quality published in 2016, written by Alessandro Raganato, Claudio Delli Bovi and Roberto Navigli.
+
'''Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2016, written by [[Alessandro Raganato]], [[Claudio Delli Bovi]] and [[Roberto Navigli]].
  
 
== Overview ==
 
== Overview ==
The hyperlink structure of Wikipedia constitutes a key resource for many Natural Language Processing tasks and applications, as it provides several million semantic annotations of entities in context. Yet only a small fraction of mentions across the entire Wikipedia corpus is linked. In this paper authors present the automatic construction and evaluation of a Semantically Enriched Wikipedia (SEW) in which the overall number of linked mentions has been more than tripled solely by exploiting the structure of Wikipedia itself and the wide-coverage sense inventory of BabelNet. As a result authors obtain a sense-annotated corpus with more than 200 million annotations of over 4 million different concepts and named entities. Authors then show that corpus leads to competitive results on multiple tasks, such as Entity Linking and Word Similarity.
+
The hyperlink structure of [[Wikipedia]] constitutes a key resource for many [[Natural Language Processing]] tasks and applications, as it provides several million semantic annotations of entities in context. Yet only a small fraction of mentions across the entire Wikipedia corpus is linked. In this paper authors present the automatic construction and evaluation of a Semantically Enriched Wikipedia (SEW) in which the overall number of linked mentions has been more than tripled solely by exploiting the structure of Wikipedia itself and the wide-coverage sense inventory of BabelNet. As a result authors obtain a sense-annotated corpus with more than 200 million annotations of over 4 million different concepts and [[named entities]]. Authors then show that corpus leads to competitive results on multiple tasks, such as Entity Linking and Word Similarity.

Revision as of 22:33, 14 July 2019

Automatic Construction and Evaluation of a Large Semantically Enriched Wikipedia - scientific work related to Wikipedia quality published in 2016, written by Alessandro Raganato, Claudio Delli Bovi and Roberto Navigli.

Overview

The hyperlink structure of Wikipedia constitutes a key resource for many Natural Language Processing tasks and applications, as it provides several million semantic annotations of entities in context. Yet only a small fraction of mentions across the entire Wikipedia corpus is linked. In this paper authors present the automatic construction and evaluation of a Semantically Enriched Wikipedia (SEW) in which the overall number of linked mentions has been more than tripled solely by exploiting the structure of Wikipedia itself and the wide-coverage sense inventory of BabelNet. As a result authors obtain a sense-annotated corpus with more than 200 million annotations of over 4 million different concepts and named entities. Authors then show that corpus leads to competitive results on multiple tasks, such as Entity Linking and Word Similarity.