Reciprocal Enrichment Between Basque Wikipedia and Machine Translation

From Wikipedia Quality
Revision as of 00:00, 18 May 2019 by Sofia (talk | contribs) (Created page with "{{Infobox work | title = Reciprocal Enrichment Between Basque Wikipedia and Machine Translation | date = 2013 | authors = Iñaki Alegria<br />Unai Cabezon<br />Unai...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


Reciprocal Enrichment Between Basque Wikipedia and Machine Translation
Authors
Iñaki Alegria
Unai Cabezon
Unai Fernandez de Betoño
Gorka Labaka
Aingeru Mayor
Kepa Sarasola
Arkaitz Zubiaga
Publication date
2013
DOI
10.1007/978-3-642-35085-6_4
Links
Original Preprint

Reciprocal Enrichment Between Basque Wikipedia and Machine Translation - scientific work related to Wikipedia quality published in 2013, written by Iñaki Alegria, Unai Cabezon, Unai Fernandez de Betoño, Gorka Labaka, Aingeru Mayor, Kepa Sarasola and Arkaitz Zubiaga.

Overview

In this chapter, authors define a collaboration framework that enables Wikipedia editors to generate new articles while they help development of Machine Translation (MT) systems by providing post-edition logs. This collaboration framework was tested with editors of Basque Wikipedia. Their post-editing of Computer Science articles has been used to improve the output of a Spanish to Basque MT system called Matxin. For the collaboration between editors and researchers, authors selected a set of 100 articles from the Spanish Wikipedia. These articles would then be used as the source texts to be translated into Basque using the MT engine. A group of volunteers from Basque Wikipedia reviewed and corrected the raw MT translations. This collaboration ultimately produced two main benefits: (i) the change logs that would potentially help improve the MT engine by using an automated statistical post-editing system, and (ii) the growth of Basque Wikipedia. The results show that this process can improve the accuracy of an Rule Based MT (RBMT) system in nearly 10 % benefiting from the post-edition of 50,000 words in the Computer Science domain. Authors believe that conclusions can be extended to MT engines involving other less-resourced languages lacking large parallel corpora or frequently updated lexical knowledge, as well as to other domains.

Embed

Wikipedia Quality

Iñaki, Alegria; Unai, Cabezon; Unai Fernandez de, Betoño; Gorka, Labaka; Aingeru, Mayor; Kepa, Sarasola; Arkaitz, Zubiaga. (2013). "[[Reciprocal Enrichment Between Basque Wikipedia and Machine Translation]]". Springer Berlin Heidelberg. DOI: 10.1007/978-3-642-35085-6_4.

English Wikipedia

{{cite journal |last1=Iñaki |first1=Alegria |last2=Unai |first2=Cabezon |last3=Unai Fernandez de |first3=Betoño |last4=Gorka |first4=Labaka |last5=Aingeru |first5=Mayor |last6=Kepa |first6=Sarasola |last7=Arkaitz |first7=Zubiaga |title=Reciprocal Enrichment Between Basque Wikipedia and Machine Translation |date=2013 |doi=10.1007/978-3-642-35085-6_4 |url=https://wikipediaquality.com/wiki/Reciprocal_Enrichment_Between_Basque_Wikipedia_and_Machine_Translation |journal=Springer Berlin Heidelberg}}

HTML

Iñaki, Alegria; Unai, Cabezon; Unai Fernandez de, Betoño; Gorka, Labaka; Aingeru, Mayor; Kepa, Sarasola; Arkaitz, Zubiaga. (2013). &quot;<a href="https://wikipediaquality.com/wiki/Reciprocal_Enrichment_Between_Basque_Wikipedia_and_Machine_Translation">Reciprocal Enrichment Between Basque Wikipedia and Machine Translation</a>&quot;. Springer Berlin Heidelberg. DOI: 10.1007/978-3-642-35085-6_4.