A Wikipedia-Based Corpus for Contextualized Machine Translation
Authors | Jennifer Drexler Pushpendre Rastogi Jacqueline Aguilar Benjamin Van Durme Matt Post |
---|---|
Publication date | 2014 |
ISBN | 978-295174088-4 |
Links |
A Wikipedia-Based Corpus for Contextualized Machine Translation - scientific work about Wikipedia quality published in 2014, written by Jennifer Drexler, Pushpendre Rastogi, Jacqueline Aguilar, Benjamin Van Durme and Matt Post.
Overview
Autors describe a corpus for and experiments in target-contextualized machine translation (MT), in which authors incorporate language models from target-language documents that are comparable in nature to the source documents. This corpus comprises (i) a set of curated English Wikipedia articles describing news events along with (ii) their comparable Spanish counterparts, (iii) a number of the Spanish source articles cited within them, and (iv) English reference translations of all the Spanish data. In experiments, authors evaluate the effect on translation quality when including language models built over these English documents and interpolated with other, separately-derived, more general language model sources. Authors find that even under this simplistic baseline approach, authors achieve significant improvements as measured by BLEU score.
Embed
Wikipedia Quality
Drexler, Jennifer; Rastogi, Pushpendre; Aguilar, Jacqueline; Van Durme, Benjamin; Post, Matt. (2014). "[[A Wikipedia-Based Corpus for Contextualized Machine Translation]]". ICIC Express Letters Volume 8, Issue 7, July 2014, pp. 1877-1882. ISBN: 978-295174088-4.
English Wikipedia
{{cite journal |last1=Drexler |first1=Jennifer |last2=Rastogi |first2=Pushpendre |last3=Aguilar |first3=Jacqueline |last4=Van Durme |first4=Benjamin |last5=Post |first5=Matt |title=A Wikipedia-Based Corpus for Contextualized Machine Translation |date=2014 |isbn=978-295174088-4 |url=https://wikipediaquality.com/wiki/A_Wikipedia-Based_Corpus_for_Contextualized_Machine_Translation |journal=ICIC Express Letters Volume 8, Issue 7, July 2014, pp. 1877-1882}}
HTML
Drexler, Jennifer; Rastogi, Pushpendre; Aguilar, Jacqueline; Van Durme, Benjamin; Post, Matt. (2014). "<a href="https://wikipediaquality.com/wiki/A_Wikipedia-Based_Corpus_for_Contextualized_Machine_Translation">A Wikipedia-Based Corpus for Contextualized Machine Translation</a>". ICIC Express Letters Volume 8, Issue 7, July 2014, pp. 1877-1882. ISBN: 978-295174088-4.