Using Wikipedia as a Reference for Extracting Semantic Information from a Text

From Wikipedia Quality
Jump to: navigation, search


Using Wikipedia as a Reference for Extracting Semantic Information from a Text
Authors
Andrea Prato
Marco Ronchetti
Publication date
2009
DOI
10.1109/SEMAPRO.2009.24
Links
Original

Using Wikipedia as a Reference for Extracting Semantic Information from a Text - scientific work related to Wikipedia quality published in 2009, written by Andrea Prato and Marco Ronchetti.

Overview

In this paper authors present an algorithm that, using Wikipedia as a reference, extracts semantic information from an arbitrary text. Authors algorithm refines a procedure proposed by others, which mines all the text contained in the whole Wikipedia. Authors refinement, based on a clustering approach, exploits the semantic information contained in certain types of Wikipedia hyperlinks, and also introduces an analysis based on multi-words. Authors algorithm outperforms current methods in that the output contains many less false positives. Authors were also able to understand which (structural) part of the texts provides most of the semantic information extracted by the algorithm.

Embed

Wikipedia Quality

Prato, Andrea; Ronchetti, Marco. (2009). "[[Using Wikipedia as a Reference for Extracting Semantic Information from a Text]]".DOI: 10.1109/SEMAPRO.2009.24.

English Wikipedia

{{cite journal |last1=Prato |first1=Andrea |last2=Ronchetti |first2=Marco |title=Using Wikipedia as a Reference for Extracting Semantic Information from a Text |date=2009 |doi=10.1109/SEMAPRO.2009.24 |url=https://wikipediaquality.com/wiki/Using_Wikipedia_as_a_Reference_for_Extracting_Semantic_Information_from_a_Text}}

HTML

Prato, Andrea; Ronchetti, Marco. (2009). &quot;<a href="https://wikipediaquality.com/wiki/Using_Wikipedia_as_a_Reference_for_Extracting_Semantic_Information_from_a_Text">Using Wikipedia as a Reference for Extracting Semantic Information from a Text</a>&quot;.DOI: 10.1109/SEMAPRO.2009.24.