Difference between revisions of "Enriching the Crosslingual Link Structure of Wikipedia - a Classification-Based Approach"

From Wikipedia Quality
Jump to: navigation, search
(Links)
(infobox)
Line 1: Line 1:
 +
{{Infobox work
 +
| title = Enriching the Crosslingual Link Structure of Wikipedia - a Classification-Based Approach
 +
| date = 2008
 +
| authors = [[Philipp Sorg]]<br />[[Philipp Cimiano]]
 +
| link = https://www.aaai.org/Papers/Workshops/2008/WS-08-15/WS08-15-009.pdf
 +
}}
 
'''Enriching the Crosslingual Link Structure of Wikipedia - a Classification-Based Approach''' - scientific work related to [[Wikipedia quality]] published in 2008, written by [[Philipp Sorg]] and [[Philipp Cimiano]].
 
'''Enriching the Crosslingual Link Structure of Wikipedia - a Classification-Based Approach''' - scientific work related to [[Wikipedia quality]] published in 2008, written by [[Philipp Sorg]] and [[Philipp Cimiano]].
  
 
== Overview ==
 
== Overview ==
 
The crosslingual link structure of [[Wikipedia]] represents a valuable resource which can be exploited for crosslingual [[natural language processing]] applications. However, this requires that it has a reasonable coverage and is furthermore accurate. For the specific language pair German/English that authors consider in experiments, authors show that roughly 50% of the articles are linked from German to English and only 14% from English to German. These figures clearly corroborate the need for an approach to automatically induce new cross-language links, especially in the light of such a dynamically growing resource such as Wikipedia. In this paper authors present a classification-based approach with the goal of inferring new cross-language links. Authors experiments show that this approach has a recall of 70% with a precision of 94% for the task of learning cross-language links on a test dataset.
 
The crosslingual link structure of [[Wikipedia]] represents a valuable resource which can be exploited for crosslingual [[natural language processing]] applications. However, this requires that it has a reasonable coverage and is furthermore accurate. For the specific language pair German/English that authors consider in experiments, authors show that roughly 50% of the articles are linked from German to English and only 14% from English to German. These figures clearly corroborate the need for an approach to automatically induce new cross-language links, especially in the light of such a dynamically growing resource such as Wikipedia. In this paper authors present a classification-based approach with the goal of inferring new cross-language links. Authors experiments show that this approach has a recall of 70% with a precision of 94% for the task of learning cross-language links on a test dataset.

Revision as of 13:04, 27 November 2020


Enriching the Crosslingual Link Structure of Wikipedia - a Classification-Based Approach
Authors
Philipp Sorg
Philipp Cimiano
Publication date
2008
Links
Original

Enriching the Crosslingual Link Structure of Wikipedia - a Classification-Based Approach - scientific work related to Wikipedia quality published in 2008, written by Philipp Sorg and Philipp Cimiano.

Overview

The crosslingual link structure of Wikipedia represents a valuable resource which can be exploited for crosslingual natural language processing applications. However, this requires that it has a reasonable coverage and is furthermore accurate. For the specific language pair German/English that authors consider in experiments, authors show that roughly 50% of the articles are linked from German to English and only 14% from English to German. These figures clearly corroborate the need for an approach to automatically induce new cross-language links, especially in the light of such a dynamically growing resource such as Wikipedia. In this paper authors present a classification-based approach with the goal of inferring new cross-language links. Authors experiments show that this approach has a recall of 70% with a precision of 94% for the task of learning cross-language links on a test dataset.