A Wikipedia-Based Corpus for Contextualized Machine Translation

From Wikipedia Quality
Jump to: navigation, search
A Wikipedia-Based Corpus for Contextualized Machine Translation
Authors
Jennifer Drexler
Pushpendre Rastogi
Jacqueline Aguilar
Benjamin Van Durme
Matt Post
Publication date
2014
ISBN
978-295174088-4
Links

A Wikipedia-Based Corpus for Contextualized Machine Translation - scientific work about Wikipedia quality published in 2014, written by Jennifer Drexler, Pushpendre Rastogi, Jacqueline Aguilar, Benjamin Van Durme and Matt Post.

Overview

Autors describe a corpus for and experiments in target-contextualized machine translation (MT), in which authors incorporate language models from target-language documents that are comparable in nature to the source documents. This corpus comprises (i) a set of curated English Wikipedia articles describing news events along with (ii) their comparable Spanish counterparts, (iii) a number of the Spanish source articles cited within them, and (iv) English reference translations of all the Spanish data. In experiments, authors evaluate the effect on translation quality when including language models built over these English documents and interpolated with other, separately-derived, more general language model sources. Authors find that even under this simplistic baseline approach, authors achieve significant improvements as measured by BLEU score.

Embed

Wikipedia Quality

Drexler, Jennifer; Rastogi, Pushpendre; Aguilar, Jacqueline; Van Durme, Benjamin; Post, Matt. (2014). "[[A Wikipedia-Based Corpus for Contextualized Machine Translation]]". ICIC Express Letters Volume 8, Issue 7, July 2014, pp. 1877-1882. ISBN: 978-295174088-4.

English Wikipedia

{{cite journal |last1=Drexler |first1=Jennifer |last2=Rastogi |first2=Pushpendre |last3=Aguilar |first3=Jacqueline |last4=Van Durme |first4=Benjamin |last5=Post |first5=Matt |title=A Wikipedia-Based Corpus for Contextualized Machine Translation |date=2014 |isbn=978-295174088-4 |url=https://wikipediaquality.com/wiki/A_Wikipedia-Based_Corpus_for_Contextualized_Machine_Translation |journal=ICIC Express Letters Volume 8, Issue 7, July 2014, pp. 1877-1882}}

HTML

Drexler, Jennifer; Rastogi, Pushpendre; Aguilar, Jacqueline; Van Durme, Benjamin; Post, Matt. (2014). &quot;<a href="https://wikipediaquality.com/wiki/A_Wikipedia-Based_Corpus_for_Contextualized_Machine_Translation">A Wikipedia-Based Corpus for Contextualized Machine Translation</a>&quot;. ICIC Express Letters Volume 8, Issue 7, July 2014, pp. 1877-1882. ISBN: 978-295174088-4.