Difference between revisions of "Extracting Structured Information from Wikipedia Articles to Populate Infoboxes"

From Wikipedia Quality
Jump to: navigation, search
(+ embed code)
(+ cat.)
 
Line 33: Line 33:
 
</nowiki>
 
</nowiki>
 
</code>
 
</code>
 +
 +
 +
 +
[[Category:Scientific works]]

Latest revision as of 11:55, 8 May 2020


Extracting Structured Information from Wikipedia Articles to Populate Infoboxes
Authors
Dustin Lange
Christoph Böhm
Felix Naumann
Publication date
2010
DOI
10.1145/1871437.1871698
Links
Original Preprint

Extracting Structured Information from Wikipedia Articles to Populate Infoboxes - scientific work related to Wikipedia quality published in 2010, written by Dustin Lange, Christoph Böhm and Felix Naumann.

Overview

Roughly every third Wikipedia article contains an infobox - a table that displays important facts about the subject in attribute-value form. The schema of an infobox, i.e., the attributes that can be expressed for a concept, is defined by an infobox template. Often, authors do not specify all template attributes, resulting in incomplete infoboxes. With iPopulator, authors introduce a system that automatically populates infoboxes of Wikipedia articles by extracting attribute values from the article's text. In contrast to prior work, iPopulator detects and exploits the structure of attribute values to independently extract value parts. Authors have tested iPopulator on the entire set of infobox templates and provide a detailed analysis of its effectiveness. For instance, authors achieve an average extraction precision of 91% for 1,727 distinct infobox template attributes.

Embed

Wikipedia Quality

Lange, Dustin; Böhm, Christoph; Naumann, Felix. (2010). "[[Extracting Structured Information from Wikipedia Articles to Populate Infoboxes]]".DOI: 10.1145/1871437.1871698.

English Wikipedia

{{cite journal |last1=Lange |first1=Dustin |last2=Böhm |first2=Christoph |last3=Naumann |first3=Felix |title=Extracting Structured Information from Wikipedia Articles to Populate Infoboxes |date=2010 |doi=10.1145/1871437.1871698 |url=https://wikipediaquality.com/wiki/Extracting_Structured_Information_from_Wikipedia_Articles_to_Populate_Infoboxes}}

HTML

Lange, Dustin; Böhm, Christoph; Naumann, Felix. (2010). &quot;<a href="https://wikipediaquality.com/wiki/Extracting_Structured_Information_from_Wikipedia_Articles_to_Populate_Infoboxes">Extracting Structured Information from Wikipedia Articles to Populate Infoboxes</a>&quot;.DOI: 10.1145/1871437.1871698.