Difference between revisions of "Cross-Lingual Entity Matching and Infobox Alignment in Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(infobox)
(Embed for English Wikipedia, HTML)
Line 10: Line 10:
 
== Overview ==
 
== Overview ==
 
Wikipedia has grown to a huge, multi-lingual source of encyclopedic knowledge. Apart from textual content, a large and ever-increasing number of articles feature so-called [[infoboxes]], which provide factual information about the articles' subjects. As the different [[language versions]] evolve independently, they provide different information on the same topics. Correspondences between infobox attributes in [[different language]] editions can be leveraged for several use cases, such as automatic detection and resolution of inconsistencies in infobox data across language versions, or the automatic augmentation of infoboxes in one language with data from other language versions. Authors present an instance-based schema matching technique that exploits information overlap in infoboxes across different language editions. As a prerequisite authors present a graph-based approach to identify articles in different languages representing the same real-world entity using (and correcting) the interlanguage links in [[Wikipedia]]. To account for the untyped nature of infobox schemas, authors present a robust similarity measure that can reliably quantify the similarity of strings with mixed types of data. The qualitative evaluation on the basis of manually labeled attribute correspondences between infoboxes in four of the largest Wikipedia editions demonstrates the effectiveness of the proposed approach.
 
Wikipedia has grown to a huge, multi-lingual source of encyclopedic knowledge. Apart from textual content, a large and ever-increasing number of articles feature so-called [[infoboxes]], which provide factual information about the articles' subjects. As the different [[language versions]] evolve independently, they provide different information on the same topics. Correspondences between infobox attributes in [[different language]] editions can be leveraged for several use cases, such as automatic detection and resolution of inconsistencies in infobox data across language versions, or the automatic augmentation of infoboxes in one language with data from other language versions. Authors present an instance-based schema matching technique that exploits information overlap in infoboxes across different language editions. As a prerequisite authors present a graph-based approach to identify articles in different languages representing the same real-world entity using (and correcting) the interlanguage links in [[Wikipedia]]. To account for the untyped nature of infobox schemas, authors present a robust similarity measure that can reliably quantify the similarity of strings with mixed types of data. The qualitative evaluation on the basis of manually labeled attribute correspondences between infoboxes in four of the largest Wikipedia editions demonstrates the effectiveness of the proposed approach.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Rinser, Daniel; Lange, Dustin; Naumann, Felix. (2013). "[[Cross-Lingual Entity Matching and Infobox Alignment in Wikipedia]]". Elsevier Science Ltd.. DOI: 10.1016/j.is.2012.10.003.
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Rinser |first1=Daniel |last2=Lange |first2=Dustin |last3=Naumann |first3=Felix |title=Cross-Lingual Entity Matching and Infobox Alignment in Wikipedia |date=2013 |doi=10.1016/j.is.2012.10.003 |url=https://wikipediaquality.com/wiki/Cross-Lingual_Entity_Matching_and_Infobox_Alignment_in_Wikipedia |journal=Elsevier Science Ltd.}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Rinser, Daniel; Lange, Dustin; Naumann, Felix. (2013). &amp;quot;<a href="https://wikipediaquality.com/wiki/Cross-Lingual_Entity_Matching_and_Infobox_Alignment_in_Wikipedia">Cross-Lingual Entity Matching and Infobox Alignment in Wikipedia</a>&amp;quot;. Elsevier Science Ltd.. DOI: 10.1016/j.is.2012.10.003.
 +
</nowiki>
 +
</code>

Revision as of 23:52, 4 January 2021


Cross-Lingual Entity Matching and Infobox Alignment in Wikipedia
Authors
Daniel Rinser
Dustin Lange
Felix Naumann
Publication date
2013
DOI
10.1016/j.is.2012.10.003
Links
Original

Cross-Lingual Entity Matching and Infobox Alignment in Wikipedia - scientific work related to Wikipedia quality published in 2013, written by Daniel Rinser, Dustin Lange and Felix Naumann.

Overview

Wikipedia has grown to a huge, multi-lingual source of encyclopedic knowledge. Apart from textual content, a large and ever-increasing number of articles feature so-called infoboxes, which provide factual information about the articles' subjects. As the different language versions evolve independently, they provide different information on the same topics. Correspondences between infobox attributes in different language editions can be leveraged for several use cases, such as automatic detection and resolution of inconsistencies in infobox data across language versions, or the automatic augmentation of infoboxes in one language with data from other language versions. Authors present an instance-based schema matching technique that exploits information overlap in infoboxes across different language editions. As a prerequisite authors present a graph-based approach to identify articles in different languages representing the same real-world entity using (and correcting) the interlanguage links in Wikipedia. To account for the untyped nature of infobox schemas, authors present a robust similarity measure that can reliably quantify the similarity of strings with mixed types of data. The qualitative evaluation on the basis of manually labeled attribute correspondences between infoboxes in four of the largest Wikipedia editions demonstrates the effectiveness of the proposed approach.

Embed

Wikipedia Quality

Rinser, Daniel; Lange, Dustin; Naumann, Felix. (2013). "[[Cross-Lingual Entity Matching and Infobox Alignment in Wikipedia]]". Elsevier Science Ltd.. DOI: 10.1016/j.is.2012.10.003.

English Wikipedia

{{cite journal |last1=Rinser |first1=Daniel |last2=Lange |first2=Dustin |last3=Naumann |first3=Felix |title=Cross-Lingual Entity Matching and Infobox Alignment in Wikipedia |date=2013 |doi=10.1016/j.is.2012.10.003 |url=https://wikipediaquality.com/wiki/Cross-Lingual_Entity_Matching_and_Infobox_Alignment_in_Wikipedia |journal=Elsevier Science Ltd.}}

HTML

Rinser, Daniel; Lange, Dustin; Naumann, Felix. (2013). &quot;<a href="https://wikipediaquality.com/wiki/Cross-Lingual_Entity_Matching_and_Infobox_Alignment_in_Wikipedia">Cross-Lingual Entity Matching and Infobox Alignment in Wikipedia</a>&quot;. Elsevier Science Ltd.. DOI: 10.1016/j.is.2012.10.003.