Reciprocal Enrichment Between Basque Wikipedia and Machine Translation
Authors | Iñaki Alegria Unai Cabezon Unai Fernandez de Betoño Gorka Labaka Aingeru Mayor Kepa Sarasola Arkaitz Zubiaga |
---|---|
Publication date | 2013 |
DOI | 10.1007/978-3-642-35085-6_4 |
Links | Original Preprint |
Reciprocal Enrichment Between Basque Wikipedia and Machine Translation - scientific work related to Wikipedia quality published in 2013, written by Iñaki Alegria, Unai Cabezon, Unai Fernandez de Betoño, Gorka Labaka, Aingeru Mayor, Kepa Sarasola and Arkaitz Zubiaga.
Overview
In this chapter, authors define a collaboration framework that enables Wikipedia editors to generate new articles while they help development of Machine Translation (MT) systems by providing post-edition logs. This collaboration framework was tested with editors of Basque Wikipedia. Their post-editing of Computer Science articles has been used to improve the output of a Spanish to Basque MT system called Matxin. For the collaboration between editors and researchers, authors selected a set of 100 articles from the Spanish Wikipedia. These articles would then be used as the source texts to be translated into Basque using the MT engine. A group of volunteers from Basque Wikipedia reviewed and corrected the raw MT translations. This collaboration ultimately produced two main benefits: (i) the change logs that would potentially help improve the MT engine by using an automated statistical post-editing system, and (ii) the growth of Basque Wikipedia. The results show that this process can improve the accuracy of an Rule Based MT (RBMT) system in nearly 10 % benefiting from the post-edition of 50,000 words in the Computer Science domain. Authors believe that conclusions can be extended to MT engines involving other less-resourced languages lacking large parallel corpora or frequently updated lexical knowledge, as well as to other domains.
Embed
Wikipedia Quality
Iñaki, Alegria; Unai, Cabezon; Unai Fernandez de, Betoño; Gorka, Labaka; Aingeru, Mayor; Kepa, Sarasola; Arkaitz, Zubiaga. (2013). "[[Reciprocal Enrichment Between Basque Wikipedia and Machine Translation]]". Springer Berlin Heidelberg. DOI: 10.1007/978-3-642-35085-6_4.
English Wikipedia
{{cite journal |last1=Iñaki |first1=Alegria |last2=Unai |first2=Cabezon |last3=Unai Fernandez de |first3=Betoño |last4=Gorka |first4=Labaka |last5=Aingeru |first5=Mayor |last6=Kepa |first6=Sarasola |last7=Arkaitz |first7=Zubiaga |title=Reciprocal Enrichment Between Basque Wikipedia and Machine Translation |date=2013 |doi=10.1007/978-3-642-35085-6_4 |url=https://wikipediaquality.com/wiki/Reciprocal_Enrichment_Between_Basque_Wikipedia_and_Machine_Translation |journal=Springer Berlin Heidelberg}}
HTML
Iñaki, Alegria; Unai, Cabezon; Unai Fernandez de, Betoño; Gorka, Labaka; Aingeru, Mayor; Kepa, Sarasola; Arkaitz, Zubiaga. (2013). "<a href="https://wikipediaquality.com/wiki/Reciprocal_Enrichment_Between_Basque_Wikipedia_and_Machine_Translation">Reciprocal Enrichment Between Basque Wikipedia and Machine Translation</a>". Springer Berlin Heidelberg. DOI: 10.1007/978-3-642-35085-6_4.