Finding Similar Sentences across Multiple Languages in Wikipedia
Authors | Sisay Fissaha Adafre Maarten De Rijke |
---|---|
Publication date | 2006 |
Links | Original |
Finding Similar Sentences across Multiple Languages in Wikipedia - scientific work related to Wikipedia quality published in 2006, written by Sisay Fissaha Adafre and Maarten De Rijke.
Overview
Authors investigate whether the Wikipedia corpus is amenable to multilingual analysis that aims at generating parallel corpora. Authors present the results of the application of two simple heuristics for the identification of similar text across multiple languages in Wikipedia. Despite the simplicity of the methods, evaluation carried out on a sample of Wikipedia pages shows encouraging results.
Embed
Wikipedia Quality
Adafre, S. Fissaha; De Rijke, M. (2006). "[[Finding Similar Sentences across Multiple Languages in Wikipedia]]".
English Wikipedia
{{cite journal |last1=Adafre |first1=S. Fissaha |last2=De Rijke |first2=M. |title=Finding Similar Sentences across Multiple Languages in Wikipedia |date=2006 |url=https://wikipediaquality.com/wiki/Finding_Similar_Sentences_across_Multiple_Languages_in_Wikipedia}}
HTML
Adafre, S. Fissaha; De Rijke, M. (2006). "<a href="https://wikipediaquality.com/wiki/Finding_Similar_Sentences_across_Multiple_Languages_in_Wikipedia">Finding Similar Sentences across Multiple Languages in Wikipedia</a>".