Difference between revisions of "Automatic Identification and Disambiguation of Concepts and Named Entities in the Multilingual Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(Overview: Automatic Identification and Disambiguation of Concepts and Named Entities in the Multilingual Wikipedia)
 
(Wikilinks)
 
Line 1: Line 1:
'''Automatic Identification and Disambiguation of Concepts and Named Entities in the Multilingual Wikipedia''' - scientific work related to Wikipedia quality published in 2015, written by Federico Scozzafava, Alessandro Raganato, Andrea Moro and Roberto Navigli.
+
'''Automatic Identification and Disambiguation of Concepts and Named Entities in the Multilingual Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2015, written by [[Federico Scozzafava]], [[Alessandro Raganato]], [[Andrea Moro]] and [[Roberto Navigli]].
  
 
== Overview ==
 
== Overview ==
In this paper authors present an automatic multilingual annotation of the Wikipedia dumps in two languages, with both word senses (i.e. concepts) and named entities. Authors use Babelfy 1.0, a state-of-the-art multilingual Word Sense Disambiguation and Entity Linking system. As its reference inventory, Babelfy draws upon BabelNet 3.0, a very large multilingual encyclopedic dictionary and semantic network which connects concepts and named entities in 271 languages from different inventories, such as WordNet, Open Multilingual WordNet, Wikipedia, OmegaWiki, Wiktionary and Wikidata. In addition, authors perform both an automatic evaluation of the dataset and a language-specific statistical analysis. In detail, authors investigate the word sense distributions by part-of-speech and language, together with the similarity of the annotated entities and concepts for a random sample of interlinked Wikipedia pages in different languages. The annotated corpora are available at http://lcl.uniroma1.it/babelfied-wikipedia/.
+
In this paper authors present an automatic [[multilingual]] annotation of the [[Wikipedia]] dumps in two languages, with both word senses (i.e. concepts) and [[named entities]]. Authors use Babelfy 1.0, a state-of-the-art multilingual Word Sense Disambiguation and Entity Linking system. As its reference inventory, Babelfy draws upon BabelNet 3.0, a very large multilingual encyclopedic dictionary and semantic network which connects concepts and named entities in 271 languages from different inventories, such as [[WordNet]], Open Multilingual WordNet, Wikipedia, OmegaWiki, Wiktionary and [[Wikidata]]. In addition, authors perform both an automatic evaluation of the dataset and a language-specific statistical analysis. In detail, authors investigate the word sense distributions by part-of-speech and language, together with the similarity of the annotated entities and concepts for a random sample of interlinked Wikipedia pages in [[different language]]s. The annotated corpora are available at http://lcl.uniroma1.it/babelfied-wikipedia/.

Latest revision as of 23:01, 12 August 2019

Automatic Identification and Disambiguation of Concepts and Named Entities in the Multilingual Wikipedia - scientific work related to Wikipedia quality published in 2015, written by Federico Scozzafava, Alessandro Raganato, Andrea Moro and Roberto Navigli.

Overview

In this paper authors present an automatic multilingual annotation of the Wikipedia dumps in two languages, with both word senses (i.e. concepts) and named entities. Authors use Babelfy 1.0, a state-of-the-art multilingual Word Sense Disambiguation and Entity Linking system. As its reference inventory, Babelfy draws upon BabelNet 3.0, a very large multilingual encyclopedic dictionary and semantic network which connects concepts and named entities in 271 languages from different inventories, such as WordNet, Open Multilingual WordNet, Wikipedia, OmegaWiki, Wiktionary and Wikidata. In addition, authors perform both an automatic evaluation of the dataset and a language-specific statistical analysis. In detail, authors investigate the word sense distributions by part-of-speech and language, together with the similarity of the annotated entities and concepts for a random sample of interlinked Wikipedia pages in different languages. The annotated corpora are available at http://lcl.uniroma1.it/babelfied-wikipedia/.