Using Linked Data to Mine Rdf from Wikipedia's Tables

From Wikipedia Quality
Jump to: navigation, search


Using Linked Data to Mine Rdf from Wikipedia's Tables
Authors
Emir Muñoz
Aidan Hogan
Alessandra Mileo
Publication date
2014
DOI
10.1145/2556195.2556266
Links
Original Preprint

Using Linked Data to Mine Rdf from Wikipedia's Tables - scientific work related to Wikipedia quality published in 2014, written by Emir Muñoz, Aidan Hogan and Alessandra Mileo.

Overview

The tables embedded in Wikipedia articles contain rich, semi-structured encyclopaedic content. However, the cumulative content of these tables cannot be queried against. Authors thus propose methods to recover the semantics of Wikipedia tables and, in particular, to extract facts from them in the form of RDF triples. Authors core method uses an existing Linked Data knowledge-base to find pre-existing relations between entities in Wikipedia tables, suggesting the same relations as holding for other entities in analogous columns on different rows. Authors find that such an approach extracts RDF triples from Wikipedia's tables at a raw precision of 40%. To improve the raw precision, authors define a set of features for extracted triples that are tracked during the extraction phase. Using a manually labelled gold standard, authors then test a variety of machine learning methods for classifying correct/incorrect triples. One such method extracts 7.9 million unique and novel RDF triples from over one million Wikipedia tables at an estimated precision of 81.5%.

Embed

Wikipedia Quality

Muñoz, Emir; Hogan, Aidan; Mileo, Alessandra. (2014). "[[Using Linked Data to Mine Rdf from Wikipedia's Tables]]".DOI: 10.1145/2556195.2556266.

English Wikipedia

{{cite journal |last1=Muñoz |first1=Emir |last2=Hogan |first2=Aidan |last3=Mileo |first3=Alessandra |title=Using Linked Data to Mine Rdf from Wikipedia's Tables |date=2014 |doi=10.1145/2556195.2556266 |url=https://wikipediaquality.com/wiki/Using_Linked_Data_to_Mine_Rdf_from_Wikipedia's_Tables}}

HTML

Muñoz, Emir; Hogan, Aidan; Mileo, Alessandra. (2014). &quot;<a href="https://wikipediaquality.com/wiki/Using_Linked_Data_to_Mine_Rdf_from_Wikipedia's_Tables">Using Linked Data to Mine Rdf from Wikipedia's Tables</a>&quot;.DOI: 10.1145/2556195.2556266.