A Wikipedia-Based Multilingual Retrieval Model

From Wikipedia Quality
Revision as of 06:04, 25 February 2021 by Phoebe (talk | contribs) (Cats.)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


A Wikipedia-Based Multilingual Retrieval Model
Authors
Martin Potthast
Benno Stein
Maik Anderka
Publication date
2008
DOI
10.1007/978-3-540-78646-7_51
Links
Original

A Wikipedia-Based Multilingual Retrieval Model - scientific work related to Wikipedia quality published in 2008, written by Martin Potthast, Benno Stein and Maik Anderka.

Overview

This paper introduces CL-ESA, a new multilingual retrieval model for the analysis of cross-language similarity. The retrieval model exploits the multilingual alignment of Wikipedia: given a document d written in language L authors construct a concept vector d for d, where each dimension i in d quantifies the similarity of d with respect to a document di* chosen from the "L-subset" of Wikipedia. Likewise, for a second document d′ written in language L′, L ≠ L′, authors construct a concept vector d′, using from the L′-subset of the Wikipedia the topic-aligned counterparts d′i* of previously chosen documents.

Embed

Wikipedia Quality

Potthast, Martin; Stein, Benno; Anderka, Maik. (2008). "[[A Wikipedia-Based Multilingual Retrieval Model]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-540-78646-7_51.

English Wikipedia

{{cite journal |last1=Potthast |first1=Martin |last2=Stein |first2=Benno |last3=Anderka |first3=Maik |title=A Wikipedia-Based Multilingual Retrieval Model |date=2008 |doi=10.1007/978-3-540-78646-7_51 |url=https://wikipediaquality.com/wiki/A_Wikipedia-Based_Multilingual_Retrieval_Model |journal=Springer, Berlin, Heidelberg}}

HTML

Potthast, Martin; Stein, Benno; Anderka, Maik. (2008). &quot;<a href="https://wikipediaquality.com/wiki/A_Wikipedia-Based_Multilingual_Retrieval_Model">A Wikipedia-Based Multilingual Retrieval Model</a>&quot;. Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-540-78646-7_51.