Finding Similar Sentences across Multiple Languages in Wikipedia

From Wikipedia Quality
Jump to: navigation, search


Finding Similar Sentences across Multiple Languages in Wikipedia
Authors
Sisay Fissaha Adafre
Maarten De Rijke
Publication date
2006
Links
Original

Finding Similar Sentences across Multiple Languages in Wikipedia - scientific work related to Wikipedia quality published in 2006, written by Sisay Fissaha Adafre and Maarten De Rijke.

Overview

Authors investigate whether the Wikipedia corpus is amenable to multilingual analysis that aims at generating parallel corpora. Authors present the results of the application of two simple heuristics for the identification of similar text across multiple languages in Wikipedia. Despite the simplicity of the methods, evaluation carried out on a sample of Wikipedia pages shows encouraging results.

Embed

Wikipedia Quality

Adafre, S. Fissaha; De Rijke, M. (2006). "[[Finding Similar Sentences across Multiple Languages in Wikipedia]]".

English Wikipedia

{{cite journal |last1=Adafre |first1=S. Fissaha |last2=De Rijke |first2=M. |title=Finding Similar Sentences across Multiple Languages in Wikipedia |date=2006 |url=https://wikipediaquality.com/wiki/Finding_Similar_Sentences_across_Multiple_Languages_in_Wikipedia}}

HTML

Adafre, S. Fissaha; De Rijke, M. (2006). &quot;<a href="https://wikipediaquality.com/wiki/Finding_Similar_Sentences_across_Multiple_Languages_in_Wikipedia">Finding Similar Sentences across Multiple Languages in Wikipedia</a>&quot;.