A Wikipedia-Based Multilingual Retrieval Model
Authors | Martin Potthast Benno Stein Maik Anderka |
---|---|
Publication date | 2008 |
DOI | 10.1007/978-3-540-78646-7_51 |
Links | Original |
A Wikipedia-Based Multilingual Retrieval Model - scientific work related to Wikipedia quality published in 2008, written by Martin Potthast, Benno Stein and Maik Anderka.
Overview
This paper introduces CL-ESA, a new multilingual retrieval model for the analysis of cross-language similarity. The retrieval model exploits the multilingual alignment of Wikipedia: given a document d written in language L authors construct a concept vector d for d, where each dimension i in d quantifies the similarity of d with respect to a document di* chosen from the "L-subset" of Wikipedia. Likewise, for a second document d′ written in language L′, L ≠ L′, authors construct a concept vector d′, using from the L′-subset of the Wikipedia the topic-aligned counterparts d′i* of previously chosen documents.
Embed
Wikipedia Quality
Potthast, Martin; Stein, Benno; Anderka, Maik. (2008). "[[A Wikipedia-Based Multilingual Retrieval Model]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-540-78646-7_51.
English Wikipedia
{{cite journal |last1=Potthast |first1=Martin |last2=Stein |first2=Benno |last3=Anderka |first3=Maik |title=A Wikipedia-Based Multilingual Retrieval Model |date=2008 |doi=10.1007/978-3-540-78646-7_51 |url=https://wikipediaquality.com/wiki/A_Wikipedia-Based_Multilingual_Retrieval_Model |journal=Springer, Berlin, Heidelberg}}
HTML
Potthast, Martin; Stein, Benno; Anderka, Maik. (2008). "<a href="https://wikipediaquality.com/wiki/A_Wikipedia-Based_Multilingual_Retrieval_Model">A Wikipedia-Based Multilingual Retrieval Model</a>". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-540-78646-7_51.