Towards Building a Multilingual Semantic Network: Identifying Interlingual Links in Wikipedia

From Wikipedia Quality
Revision as of 09:01, 14 August 2020 by Aaliyah (talk | contribs) (New study: Towards Building a Multilingual Semantic Network: Identifying Interlingual Links in Wikipedia)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Towards Building a Multilingual Semantic Network: Identifying Interlingual Links in Wikipedia - scientific work related to Wikipedia quality published in 2012, written by Bharath Dandala, Rada Mihalcea and Razvan C. Bunescu.

Overview

Wikipedia is a Web based, freely available multilingual encyclopedia, constructed in a collaborative effort by thousands of contributors. Wikipedia articles on the same topic in different languages are connected via interlingual (or translational) links. These links serve as an excellent resource for obtaining lexical translations, or building multilingual dictionaries and semantic networks. As these links are manually built, many links are missing or simply wrong. This paper describes a supervised learning method for generating new links and detecting existing incorrect links. Since there is no dataset available to evaluate the resulting interlingual links, authors create own gold standard by sampling translational links from four language pairs using distance heuristics. Authors manually annotate the sampled translation links and used them to evaluate the output of method for automatic link detection and correction.