Named Entities from Wikipedia for Machine Translation
Authors | Ondrej Hálek Rudolf Rosa Aleš Tamchyna Ondřej Bojar |
---|---|
Publication date | 2011 |
ISSN | 16130073 |
Links |
Named Entities from Wikipedia for Machine Translation - scientific work about Wikipedia quality published in 2011, written by Ondrej Hálek, Rudolf Rosa, Aleš Tamchyna and Ondřej Bojar.
Overview
In this paper authors present their attempt to improve machine translation of named entities by using Wikipedia. Authors recognize named entities based on categories of English Wikipedia articles, extract their potential translations from corresponding Czech articles and incorporate them into a statistical machine translation system as translation options. Their results show a decrease of translation quality in terms of automatic metrics but positive results from human annotators. Authors conclude that this approach can lead to many errors in translation and therefore should always be combined with the standard statistical translation model and weighted appropriately.
Embed
Wikipedia Quality
Hálek, Ondrej; Rosa, Rudolf; Tamchyna, Aleš; Bojar, Ondřej. (2011). "[[Named Entities from Wikipedia for Machine Translation]]". CEUR Workshop Proceedings Volume 788, 2011, pp. 23-30. ISSN: 16130073.
English Wikipedia
{{cite journal |last1=Hálek |first1=Ondrej |last2=Rosa |first2=Rudolf |last3=Tamchyna |first3=Aleš |last4=Bojar |first4=Ondřej |title=Named Entities from Wikipedia for Machine Translation |date=2011 |issn=16130073 |url=https://wikipediaquality.com/wiki/Named_Entities_from_Wikipedia_for_Machine_Translation |journal=CEUR Workshop Proceedings Volume 788, 2011, pp. 23-30}}
HTML
Hálek, Ondrej; Rosa, Rudolf; Tamchyna, Aleš; Bojar, Ondřej. (2011). "<a href="https://wikipediaquality.com/wiki/Named_Entities_from_Wikipedia_for_Machine_Translation">Named Entities from Wikipedia for Machine Translation</a>". CEUR Workshop Proceedings Volume 788, 2011, pp. 23-30. ISSN: 16130073.