Translating the Swedish Wikipedia into Danish

From Wikipedia Quality
Revision as of 00:11, 8 August 2019 by Emilia (talk | contribs) (+ links)
Jump to: navigation, search

Translating the Swedish Wikipedia into Danish - scientific work related to Wikipedia quality published in 2014, written by Eckhard Bick.

Overview

Abstract. This paper presents a Swedish-Danish automatic translation system for Wikipedia articles (WikiTrans). This paper presents a Swedish-Danish automatic translation system for Wikipedia articles (WikiTrans). Translated articles are indexed for both title and content, and integrated with original Danish articles where they exist. Changed or added articles in the Swedish Wikipedia are monitored and added on a daily basis. The translation approach uses a grammar-based machine translation system with a deep source-language structural analysis. Disambiguation and lexical transfer rules exploit Constraint Grammar tags and dependency links to access contextual information, such as syntactic argument function, semantic type and quantifiers. Out-of-vocabulary words are handled by derivational and compound analysis with a combined coverage of 99.3%, as well as systematic morpho-phonemic transliterations for the remaining cases. The system achieved BLEU scores of 0.65-0.8 depending on references and outperformed both STMT and RBMT competitors by a large margin.