Difference between revisions of "Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(Infobox)
(Embed for English Wikipedia, HTML)
Line 10: Line 10:
 
== Overview ==
 
== Overview ==
 
Authors present a novel method for discovering missing cross-language links between English and Japanese [[Wikipedia]] articles. Authors collect candidates of missing cross-language links -- a pair of English and Japanese Wikipedia articles, which could be connected by cross-language links. Then authors select the correct cross-language links among the candidates by using a classifier trained with various types of [[features]]. Authors method has three desirable characteristics for discovering missing links. First, method can discover cross-language links with high accuracy (92% precision with 78% recall rates). Second, the features used in a classifier are language-independent. Third, without relying on any external knowledge, authors generate the features based on resources automatically obtained from Wikipedia. In this work, authors discover approximately $10^5$ missing cross-language links from Wikipedia, which are almost two-thirds as many as the existing cross-language links in Wikipedia.
 
Authors present a novel method for discovering missing cross-language links between English and Japanese [[Wikipedia]] articles. Authors collect candidates of missing cross-language links -- a pair of English and Japanese Wikipedia articles, which could be connected by cross-language links. Then authors select the correct cross-language links among the candidates by using a classifier trained with various types of [[features]]. Authors method has three desirable characteristics for discovering missing links. First, method can discover cross-language links with high accuracy (92% precision with 78% recall rates). Second, the features used in a classifier are language-independent. Third, without relying on any external knowledge, authors generate the features based on resources automatically obtained from Wikipedia. In this work, authors discover approximately $10^5$ missing cross-language links from Wikipedia, which are almost two-thirds as many as the existing cross-language links in Wikipedia.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Oh, Jong-Hoon; Kawahara, Daisuke; Uchimoto, Kiyotaka; Kazama, Jun’ichi; Torisawa, Kentaro. (2008). "[[Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia]]".DOI: 10.1109/WIIAT.2008.317.
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Oh |first1=Jong-Hoon |last2=Kawahara |first2=Daisuke |last3=Uchimoto |first3=Kiyotaka |last4=Kazama |first4=Jun’ichi |last5=Torisawa |first5=Kentaro |title=Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia |date=2008 |doi=10.1109/WIIAT.2008.317 |url=https://wikipediaquality.com/wiki/Enriching_Multilingual_Language_Resources_by_Discovering_Missing_Cross-Language_Links_in_Wikipedia}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Oh, Jong-Hoon; Kawahara, Daisuke; Uchimoto, Kiyotaka; Kazama, Jun’ichi; Torisawa, Kentaro. (2008). &amp;quot;<a href="https://wikipediaquality.com/wiki/Enriching_Multilingual_Language_Resources_by_Discovering_Missing_Cross-Language_Links_in_Wikipedia">Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia</a>&amp;quot;.DOI: 10.1109/WIIAT.2008.317.
 +
</nowiki>
 +
</code>

Revision as of 15:33, 13 February 2021


Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia
Authors
Jong-Hoon Oh
Daisuke Kawahara
Kiyotaka Uchimoto
Jun’ichi Kazama
Kentaro Torisawa
Publication date
2008
DOI
10.1109/WIIAT.2008.317
Links
Original

Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia - scientific work related to Wikipedia quality published in 2008, written by Jong-Hoon Oh, Daisuke Kawahara, Kiyotaka Uchimoto, Jun’ichi Kazama and Kentaro Torisawa.

Overview

Authors present a novel method for discovering missing cross-language links between English and Japanese Wikipedia articles. Authors collect candidates of missing cross-language links -- a pair of English and Japanese Wikipedia articles, which could be connected by cross-language links. Then authors select the correct cross-language links among the candidates by using a classifier trained with various types of features. Authors method has three desirable characteristics for discovering missing links. First, method can discover cross-language links with high accuracy (92% precision with 78% recall rates). Second, the features used in a classifier are language-independent. Third, without relying on any external knowledge, authors generate the features based on resources automatically obtained from Wikipedia. In this work, authors discover approximately $10^5$ missing cross-language links from Wikipedia, which are almost two-thirds as many as the existing cross-language links in Wikipedia.

Embed

Wikipedia Quality

Oh, Jong-Hoon; Kawahara, Daisuke; Uchimoto, Kiyotaka; Kazama, Jun’ichi; Torisawa, Kentaro. (2008). "[[Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia]]".DOI: 10.1109/WIIAT.2008.317.

English Wikipedia

{{cite journal |last1=Oh |first1=Jong-Hoon |last2=Kawahara |first2=Daisuke |last3=Uchimoto |first3=Kiyotaka |last4=Kazama |first4=Jun’ichi |last5=Torisawa |first5=Kentaro |title=Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia |date=2008 |doi=10.1109/WIIAT.2008.317 |url=https://wikipediaquality.com/wiki/Enriching_Multilingual_Language_Resources_by_Discovering_Missing_Cross-Language_Links_in_Wikipedia}}

HTML

Oh, Jong-Hoon; Kawahara, Daisuke; Uchimoto, Kiyotaka; Kazama, Jun’ichi; Torisawa, Kentaro. (2008). &quot;<a href="https://wikipediaquality.com/wiki/Enriching_Multilingual_Language_Resources_by_Discovering_Missing_Cross-Language_Links_in_Wikipedia">Enriching Multilingual Language Resources by Discovering Missing Cross-Language Links in Wikipedia</a>&quot;.DOI: 10.1109/WIIAT.2008.317.