Difference between revisions of "Translating the Swedish Wikipedia into Danish"
(+ links) |
(Infobox) |
||
Line 1: | Line 1: | ||
+ | {{Infobox work | ||
+ | | title = Translating the Swedish Wikipedia into Danish | ||
+ | | date = 2014 | ||
+ | | authors = [[Eckhard Bick]] | ||
+ | | link = http://www2.lingfil.uu.se/SLTC2014/abstracts/sltc2014_submission_2.pdf | ||
+ | }} | ||
'''Translating the Swedish Wikipedia into Danish''' - scientific work related to [[Wikipedia quality]] published in 2014, written by [[Eckhard Bick]]. | '''Translating the Swedish Wikipedia into Danish''' - scientific work related to [[Wikipedia quality]] published in 2014, written by [[Eckhard Bick]]. | ||
== Overview == | == Overview == | ||
Abstract. This paper presents a Swedish-Danish automatic translation system for [[Wikipedia]] articles (WikiTrans). This paper presents a Swedish-Danish automatic translation system for Wikipedia articles (WikiTrans). Translated articles are indexed for both title and content, and integrated with original Danish articles where they exist. Changed or added articles in the Swedish Wikipedia are monitored and added on a daily basis. The translation approach uses a grammar-based [[machine translation]] system with a deep source-language structural analysis. Disambiguation and lexical transfer rules exploit Constraint Grammar tags and dependency links to access contextual information, such as syntactic argument function, semantic type and quantifiers. Out-of-vocabulary words are handled by derivational and compound analysis with a combined coverage of 99.3%, as well as systematic morpho-phonemic transliterations for the remaining cases. The system achieved BLEU scores of 0.65-0.8 depending on references and outperformed both STMT and RBMT competitors by a large margin. | Abstract. This paper presents a Swedish-Danish automatic translation system for [[Wikipedia]] articles (WikiTrans). This paper presents a Swedish-Danish automatic translation system for Wikipedia articles (WikiTrans). Translated articles are indexed for both title and content, and integrated with original Danish articles where they exist. Changed or added articles in the Swedish Wikipedia are monitored and added on a daily basis. The translation approach uses a grammar-based [[machine translation]] system with a deep source-language structural analysis. Disambiguation and lexical transfer rules exploit Constraint Grammar tags and dependency links to access contextual information, such as syntactic argument function, semantic type and quantifiers. Out-of-vocabulary words are handled by derivational and compound analysis with a combined coverage of 99.3%, as well as systematic morpho-phonemic transliterations for the remaining cases. The system achieved BLEU scores of 0.65-0.8 depending on references and outperformed both STMT and RBMT competitors by a large margin. |
Revision as of 08:50, 20 December 2019
Authors | Eckhard Bick |
---|---|
Publication date | 2014 |
Links | Original |
Translating the Swedish Wikipedia into Danish - scientific work related to Wikipedia quality published in 2014, written by Eckhard Bick.
Overview
Abstract. This paper presents a Swedish-Danish automatic translation system for Wikipedia articles (WikiTrans). This paper presents a Swedish-Danish automatic translation system for Wikipedia articles (WikiTrans). Translated articles are indexed for both title and content, and integrated with original Danish articles where they exist. Changed or added articles in the Swedish Wikipedia are monitored and added on a daily basis. The translation approach uses a grammar-based machine translation system with a deep source-language structural analysis. Disambiguation and lexical transfer rules exploit Constraint Grammar tags and dependency links to access contextual information, such as syntactic argument function, semantic type and quantifiers. Out-of-vocabulary words are handled by derivational and compound analysis with a combined coverage of 99.3%, as well as systematic morpho-phonemic transliterations for the remaining cases. The system achieved BLEU scores of 0.65-0.8 depending on references and outperformed both STMT and RBMT competitors by a large margin.