An English-Translated Parallel Corpus for the Cjk Wikipedia Collections

From Wikipedia Quality
Jump to: navigation, search


An English-Translated Parallel Corpus for the Cjk Wikipedia Collections
Authors
Ling-Xiang Tang
Shlomo Geva
Andrew Trotman
Publication date
2012
DOI
10.1145/2407085.2407099
Links
Original

An English-Translated Parallel Corpus for the Cjk Wikipedia Collections - scientific work related to Wikipedia quality published in 2012, written by Ling-Xiang Tang, Shlomo Geva and Andrew Trotman.

Overview

In this paper, authors describe a machine-translated parallel English corpus for the NTCIR Chinese, Japanese and Korean (CJK) Wikipedia collections. This document collection is named CJK2E Wikipedia XML corpus . The corpus could be used by the information retrieval research community and knowledge sharing in Wikipedia in many ways; for example, this corpus could be used for experimentations in cross-lingual information retrieval, cross-lingual link discovery, or omni-lingual information retrieval research. Furthermore, the translated CJK articles could be used to further expand the current coverage of the English Wikipedia.

Embed

Wikipedia Quality

Tang, Ling-Xiang; Geva, Shlomo; Trotman, Andrew. (2012). "[[An English-Translated Parallel Corpus for the Cjk Wikipedia Collections]]".DOI: 10.1145/2407085.2407099.

English Wikipedia

{{cite journal |last1=Tang |first1=Ling-Xiang |last2=Geva |first2=Shlomo |last3=Trotman |first3=Andrew |title=An English-Translated Parallel Corpus for the Cjk Wikipedia Collections |date=2012 |doi=10.1145/2407085.2407099 |url=https://wikipediaquality.com/wiki/An_English-Translated_Parallel_Corpus_for_the_Cjk_Wikipedia_Collections}}

HTML

Tang, Ling-Xiang; Geva, Shlomo; Trotman, Andrew. (2012). &quot;<a href="https://wikipediaquality.com/wiki/An_English-Translated_Parallel_Corpus_for_the_Cjk_Wikipedia_Collections">An English-Translated Parallel Corpus for the Cjk Wikipedia Collections</a>&quot;.DOI: 10.1145/2407085.2407099.