Using Wikipedia to Translate Domain-Specific Terms in Smt

From Wikipedia Quality
Jump to: navigation, search


Using Wikipedia to Translate Domain-Specific Terms in Smt
Authors
Jan Niehues
Alex Waibel
Publication date
2011
Links
Original

Using Wikipedia to Translate Domain-Specific Terms in Smt - scientific work related to Wikipedia quality published in 2011, written by Jan Niehues and Alex Waibel.

Overview

When building a university lecture translation system, one important step is to adapt it to the target domain. One problem in this adaptation task is to acquire translations for domain specific terms. In this approach authors tried to get these translations from Wikipedia, which provides articles on very specific topics in many different languages. To extract translations for the domain specific terms, authors used the interlanguage links of Wikipedia . Authors analyzed different methods to integrate this corpus into system and explored methods to disambiguate between different translations by using the text of the articles. In addition, authors developed methods to handle different morphological forms of the specific terms in morphologically rich input languages like German. The results show that the number of out-of-vocabulary (OOV) words could be reduced by 50% on computer science lectures and the translation quality could be improved by more than 1 BLEU point.

Embed

Wikipedia Quality

Niehues, Jan; Waibel, Alex. (2011). "[[Using Wikipedia to Translate Domain-Specific Terms in Smt]]".

English Wikipedia

{{cite journal |last1=Niehues |first1=Jan |last2=Waibel |first2=Alex |title=Using Wikipedia to Translate Domain-Specific Terms in Smt |date=2011 |url=https://wikipediaquality.com/wiki/Using_Wikipedia_to_Translate_Domain-Specific_Terms_in_Smt}}

HTML

Niehues, Jan; Waibel, Alex. (2011). &quot;<a href="https://wikipediaquality.com/wiki/Using_Wikipedia_to_Translate_Domain-Specific_Terms_in_Smt">Using Wikipedia to Translate Domain-Specific Terms in Smt</a>&quot;.