Named Entities from Wikipedia for Machine Translation

From Wikipedia Quality
Revision as of 00:41, 4 July 2018 by Librarian (talk | contribs) (New scientific work)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


Named Entities from Wikipedia for Machine Translation
Authors
Ondrej Hálek
Rudolf Rosa
Aleš Tamchyna
Ondřej Bojar
Publication date
2011
ISSN
16130073
Links

Named Entities from Wikipedia for Machine Translation - scientific work about Wikipedia quality published in 2011, written by Ondrej Hálek, Rudolf Rosa, Aleš Tamchyna and Ondřej Bojar.

Overview

In this paper authors present their attempt to improve machine translation of named entities by using Wikipedia. Authors recognize named entities based on categories of English Wikipedia articles, extract their potential translations from corresponding Czech articles and incorporate them into a statistical machine translation system as translation options. Their results show a decrease of translation quality in terms of automatic metrics but positive results from human annotators. Authors conclude that this approach can lead to many errors in translation and therefore should always be combined with the standard statistical translation model and weighted appropriately.

Embed

Wikipedia Quality

Hálek, Ondrej; Rosa, Rudolf; Tamchyna, Aleš; Bojar, Ondřej. (2011). "[[Named Entities from Wikipedia for Machine Translation]]". CEUR Workshop Proceedings Volume 788, 2011, pp. 23-30. ISSN: 16130073.

English Wikipedia

{{cite journal |last1=Hálek |first1=Ondrej |last2=Rosa |first2=Rudolf |last3=Tamchyna |first3=Aleš |last4=Bojar |first4=Ondřej |title=Named Entities from Wikipedia for Machine Translation |date=2011 |issn=16130073 |url=https://wikipediaquality.com/wiki/Named_Entities_from_Wikipedia_for_Machine_Translation |journal=CEUR Workshop Proceedings Volume 788, 2011, pp. 23-30}}

HTML

Hálek, Ondrej; Rosa, Rudolf; Tamchyna, Aleš; Bojar, Ondřej. (2011). &quot;<a href="https://wikipediaquality.com/wiki/Named_Entities_from_Wikipedia_for_Machine_Translation">Named Entities from Wikipedia for Machine Translation</a>&quot;. CEUR Workshop Proceedings Volume 788, 2011, pp. 23-30. ISSN: 16130073.