Difference between revisions of "Automatic Detection of Outdated Information in Wikipedia Infoboxes"

From Wikipedia Quality
Jump to: navigation, search
(+ infobox)
(+ embed code)
Line 9: Line 9:
 
== Overview ==
 
== Overview ==
 
An infobox of a [[Wikipedia]] article generally contains key facts in the article and is organized as attribute-value pairs. Infoboxes not only allow read- ers to rapidly gather the most important information about some aspects of the articles in which they appear, but also provide a source for many knowledge ba- ses derived from Wikipedia. However, not all the values of infobox attributes are updated frequently and accurately. In this paper, authors propose a method to au- tomatically detect outdated attribute values in Wikipedia [[infoboxes]] by using facts extracted from the general Web. Authors method uses the pattern-based fact extraction approach. The patterns for fact extraction are automatically learned using a number of available seeds in related Wikipedia infoboxes. Authors have tested and evaluated system on a set of 100 well-established com- panies in the NASDAQ-100 index on their employee numbers, presented by the num_employees attribute value in their Wikipedia article infoboxes. The achieved accuracy is 77% and test result also reveals that 82% of the companies do not have their latest numbers of employees in their Wikipedia article infoboxes.
 
An infobox of a [[Wikipedia]] article generally contains key facts in the article and is organized as attribute-value pairs. Infoboxes not only allow read- ers to rapidly gather the most important information about some aspects of the articles in which they appear, but also provide a source for many knowledge ba- ses derived from Wikipedia. However, not all the values of infobox attributes are updated frequently and accurately. In this paper, authors propose a method to au- tomatically detect outdated attribute values in Wikipedia [[infoboxes]] by using facts extracted from the general Web. Authors method uses the pattern-based fact extraction approach. The patterns for fact extraction are automatically learned using a number of available seeds in related Wikipedia infoboxes. Authors have tested and evaluated system on a set of 100 well-established com- panies in the NASDAQ-100 index on their employee numbers, presented by the num_employees attribute value in their Wikipedia article infoboxes. The achieved accuracy is 77% and test result also reveals that 82% of the companies do not have their latest numbers of employees in their Wikipedia article infoboxes.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Tran, Thong; Cao, Tru H.. (2013). "[[Automatic Detection of Outdated Information in Wikipedia Infoboxes]]".
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Tran |first1=Thong |last2=Cao |first2=Tru H. |title=Automatic Detection of Outdated Information in Wikipedia Infoboxes |date=2013 |url=https://wikipediaquality.com/wiki/Automatic_Detection_of_Outdated_Information_in_Wikipedia_Infoboxes}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Tran, Thong; Cao, Tru H.. (2013). &amp;quot;<a href="https://wikipediaquality.com/wiki/Automatic_Detection_of_Outdated_Information_in_Wikipedia_Infoboxes">Automatic Detection of Outdated Information in Wikipedia Infoboxes</a>&amp;quot;.
 +
</nowiki>
 +
</code>

Revision as of 05:43, 24 October 2020


Automatic Detection of Outdated Information in Wikipedia Infoboxes
Authors
Thong Tran
Tru H. Cao
Publication date
2013
Links
Original

Automatic Detection of Outdated Information in Wikipedia Infoboxes - scientific work related to Wikipedia quality published in 2013, written by Thong Tran and Tru H. Cao.

Overview

An infobox of a Wikipedia article generally contains key facts in the article and is organized as attribute-value pairs. Infoboxes not only allow read- ers to rapidly gather the most important information about some aspects of the articles in which they appear, but also provide a source for many knowledge ba- ses derived from Wikipedia. However, not all the values of infobox attributes are updated frequently and accurately. In this paper, authors propose a method to au- tomatically detect outdated attribute values in Wikipedia infoboxes by using facts extracted from the general Web. Authors method uses the pattern-based fact extraction approach. The patterns for fact extraction are automatically learned using a number of available seeds in related Wikipedia infoboxes. Authors have tested and evaluated system on a set of 100 well-established com- panies in the NASDAQ-100 index on their employee numbers, presented by the num_employees attribute value in their Wikipedia article infoboxes. The achieved accuracy is 77% and test result also reveals that 82% of the companies do not have their latest numbers of employees in their Wikipedia article infoboxes.

Embed

Wikipedia Quality

Tran, Thong; Cao, Tru H.. (2013). "[[Automatic Detection of Outdated Information in Wikipedia Infoboxes]]".

English Wikipedia

{{cite journal |last1=Tran |first1=Thong |last2=Cao |first2=Tru H. |title=Automatic Detection of Outdated Information in Wikipedia Infoboxes |date=2013 |url=https://wikipediaquality.com/wiki/Automatic_Detection_of_Outdated_Information_in_Wikipedia_Infoboxes}}

HTML

Tran, Thong; Cao, Tru H.. (2013). &quot;<a href="https://wikipediaquality.com/wiki/Automatic_Detection_of_Outdated_Information_in_Wikipedia_Infoboxes">Automatic Detection of Outdated Information in Wikipedia Infoboxes</a>&quot;.