Unsupervised Synthesis of Multilingual Wikipedia Articles

From Wikipedia Quality
Jump to: navigation, search


Unsupervised Synthesis of Multilingual Wikipedia Articles
Authors
Chen Yuncong
Pascale Fung
Publication date
2010
Links

Unsupervised Synthesis of Multilingual Wikipedia Articles - scientific work about Wikipedia quality published in 2010, written by Chen Yuncong and Pascale Fung.

Overview

In this paper, authors propose an unsupervised approach to automatically synthesize Wikipedia articles in multiple languages. Taking an existing high-quality version of any entry as content guideline, authors extract keywords from it and use the translated keywords to query the monolingual web of the target language. Candidate excerpts or sentences are selected based on an iterative ranking function and eventually synthesized into a complete article that resembles the reference version closely. 16 English and Chinese articles across 5 domains are evaluated to show that their algorithm is domain in dependent. Both subjective evaluations by native Chinese readers and ROUGE-L scores computed with respect to standard reference articles demonstrate that synthesized articles outperform existing Chinese versions or MT texts in both content richness and readability. In practice their method can generate prototype texts for Wikipedia that facilitate later human authoring.

Embed

Wikipedia Quality

Yuncong, Chen; Fung, Pascale. (2010). "[[Unsupervised Synthesis of Multilingual Wikipedia Articles]]". International Conference on Information and Knowledge Management, Proceedings 2010, pp. 1289-1292.

English Wikipedia

{{cite journal |last1=Yuncong |first1=Chen |last2=Fung |first2=Pascale |title=Unsupervised Synthesis of Multilingual Wikipedia Articles |date=2010 |url=https://wikipediaquality.com/wiki/Unsupervised_Synthesis_of_Multilingual_Wikipedia_Articles |journal=International Conference on Information and Knowledge Management, Proceedings 2010, pp. 1289-1292}}

HTML

Yuncong, Chen; Fung, Pascale. (2010). &quot;<a href="https://wikipediaquality.com/wiki/Unsupervised_Synthesis_of_Multilingual_Wikipedia_Articles">Unsupervised Synthesis of Multilingual Wikipedia Articles</a>&quot;. International Conference on Information and Knowledge Management, Proceedings 2010, pp. 1289-1292.