Difference between revisions of "A Multilingual Approach to Discover Cross-Language Links in Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(Infobox work)
(Embed for English Wikipedia, HTML)
Line 10: Line 10:
 
== Overview ==
 
== Overview ==
 
Wikipedia is a well-known public and collaborative encyclopaedia consisting of millions of articles. Initially in English, the popular website has grown to include versions in over 288 languages. These versions and their articles are interconnected via cross-language links, which not only facilitate navigation and understanding of concepts in [[multiple languages]], but have been used in [[natural language processing]] applications, developments in linked open data, and expansion of minor [[Wikipedia]] [[language versions]]. These applications are the motivation for an automatic, robust, and accurate technique to identify cross-language links. In this paper, authors present a [[multilingual]] approach called EurekaCL to automatically identify missing cross-language links in Wikipedia. More precisely, given a Wikipedia article the source EurekaCL uses the multilingual and semantic [[features]] of BabelNet 2.0 in order to efficiently identify a set of candidate articles in a target language that are likely to cover the same topic as the source. The Wikipedia graph structure is then exploited both to prune and to rank the candidates. Authors evaluation carried out on 42,000 pairs of articles in eight language versions of Wikipedia shows that candidate selection and pruning procedures allow an effective selection of candidates which significantly helps the determination of the correct article in the target language version.
 
Wikipedia is a well-known public and collaborative encyclopaedia consisting of millions of articles. Initially in English, the popular website has grown to include versions in over 288 languages. These versions and their articles are interconnected via cross-language links, which not only facilitate navigation and understanding of concepts in [[multiple languages]], but have been used in [[natural language processing]] applications, developments in linked open data, and expansion of minor [[Wikipedia]] [[language versions]]. These applications are the motivation for an automatic, robust, and accurate technique to identify cross-language links. In this paper, authors present a [[multilingual]] approach called EurekaCL to automatically identify missing cross-language links in Wikipedia. More precisely, given a Wikipedia article the source EurekaCL uses the multilingual and semantic [[features]] of BabelNet 2.0 in order to efficiently identify a set of candidate articles in a target language that are likely to cover the same topic as the source. The Wikipedia graph structure is then exploited both to prune and to rank the candidates. Authors evaluation carried out on 42,000 pairs of articles in eight language versions of Wikipedia shows that candidate selection and pruning procedures allow an effective selection of candidates which significantly helps the determination of the correct article in the target language version.
 +
 +
== Embed ==
 +
=== Wikipedia Quality ===
 +
<code>
 +
<nowiki>
 +
Bennacer, Nacéra; Vioulès, Mia Johnson; López, Maximiliano Ariel; Quercini, Gianluca. (2015). "[[A Multilingual Approach to Discover Cross-Language Links in Wikipedia]]". Springer, Cham. DOI: 10.1007/978-3-319-26190-4_36.
 +
</nowiki>
 +
</code>
 +
 +
=== English Wikipedia ===
 +
<code>
 +
<nowiki>
 +
{{cite journal |last1=Bennacer |first1=Nacéra |last2=Vioulès |first2=Mia Johnson |last3=López |first3=Maximiliano Ariel |last4=Quercini |first4=Gianluca |title=A Multilingual Approach to Discover Cross-Language Links in Wikipedia |date=2015 |doi=10.1007/978-3-319-26190-4_36 |url=https://wikipediaquality.com/wiki/A_Multilingual_Approach_to_Discover_Cross-Language_Links_in_Wikipedia |journal=Springer, Cham}}
 +
</nowiki>
 +
</code>
 +
 +
=== HTML ===
 +
<code>
 +
<nowiki>
 +
Bennacer, Nacéra; Vioulès, Mia Johnson; López, Maximiliano Ariel; Quercini, Gianluca. (2015). &amp;quot;<a href="https://wikipediaquality.com/wiki/A_Multilingual_Approach_to_Discover_Cross-Language_Links_in_Wikipedia">A Multilingual Approach to Discover Cross-Language Links in Wikipedia</a>&amp;quot;. Springer, Cham. DOI: 10.1007/978-3-319-26190-4_36.
 +
</nowiki>
 +
</code>

Revision as of 23:30, 29 January 2021


A Multilingual Approach to Discover Cross-Language Links in Wikipedia
Authors
Nacéra Bennacer
Mia Johnson Vioulès
Maximiliano Ariel López
Gianluca Quercini
Publication date
2015
DOI
10.1007/978-3-319-26190-4_36
Links
Original

A Multilingual Approach to Discover Cross-Language Links in Wikipedia - scientific work related to Wikipedia quality published in 2015, written by Nacéra Bennacer, Mia Johnson Vioulès, Maximiliano Ariel López and Gianluca Quercini.

Overview

Wikipedia is a well-known public and collaborative encyclopaedia consisting of millions of articles. Initially in English, the popular website has grown to include versions in over 288 languages. These versions and their articles are interconnected via cross-language links, which not only facilitate navigation and understanding of concepts in multiple languages, but have been used in natural language processing applications, developments in linked open data, and expansion of minor Wikipedia language versions. These applications are the motivation for an automatic, robust, and accurate technique to identify cross-language links. In this paper, authors present a multilingual approach called EurekaCL to automatically identify missing cross-language links in Wikipedia. More precisely, given a Wikipedia article the source EurekaCL uses the multilingual and semantic features of BabelNet 2.0 in order to efficiently identify a set of candidate articles in a target language that are likely to cover the same topic as the source. The Wikipedia graph structure is then exploited both to prune and to rank the candidates. Authors evaluation carried out on 42,000 pairs of articles in eight language versions of Wikipedia shows that candidate selection and pruning procedures allow an effective selection of candidates which significantly helps the determination of the correct article in the target language version.

Embed

Wikipedia Quality

Bennacer, Nacéra; Vioulès, Mia Johnson; López, Maximiliano Ariel; Quercini, Gianluca. (2015). "[[A Multilingual Approach to Discover Cross-Language Links in Wikipedia]]". Springer, Cham. DOI: 10.1007/978-3-319-26190-4_36.

English Wikipedia

{{cite journal |last1=Bennacer |first1=Nacéra |last2=Vioulès |first2=Mia Johnson |last3=López |first3=Maximiliano Ariel |last4=Quercini |first4=Gianluca |title=A Multilingual Approach to Discover Cross-Language Links in Wikipedia |date=2015 |doi=10.1007/978-3-319-26190-4_36 |url=https://wikipediaquality.com/wiki/A_Multilingual_Approach_to_Discover_Cross-Language_Links_in_Wikipedia |journal=Springer, Cham}}

HTML

Bennacer, Nacéra; Vioulès, Mia Johnson; López, Maximiliano Ariel; Quercini, Gianluca. (2015). &quot;<a href="https://wikipediaquality.com/wiki/A_Multilingual_Approach_to_Discover_Cross-Language_Links_in_Wikipedia">A Multilingual Approach to Discover Cross-Language Links in Wikipedia</a>&quot;. Springer, Cham. DOI: 10.1007/978-3-319-26190-4_36.