Combining Wikipedia-Based Concept Models for Cross-Language Retrieval
Authors | Benjamin Roth Dietrich Klakow |
---|---|
Publication date | 2010 |
DOI | 10.1007/978-3-642-13084-7_5 |
Links | Original |
Combining Wikipedia-Based Concept Models for Cross-Language Retrieval - scientific work related to Wikipedia quality published in 2010, written by Benjamin Roth and Dietrich Klakow.
Overview
As a low-cost ressource that is up-to-date, Wikipedia recently gains attention as a means to provide cross-language brigding for information retrieval. Contradictory to a previous study, authors show that standard Latent Dirichlet Allocation (LDA) can extract cross-language information that is valuable for IR by simply normalizing the training data. Furthermore, authors show that LDA and Explicit Semantic Analysis (ESA) complement each other, yielding significant improvements when combined. Such a combination can significantly contribute to retrieval based on machine translation, especially when query translations contain errors. The experiments were perfomed on the Multext JOC corpus und a CLEF dataset.
Embed
Wikipedia Quality
Roth, Benjamin; Klakow, Dietrich. (2010). "[[Combining Wikipedia-Based Concept Models for Cross-Language Retrieval]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-13084-7_5.
English Wikipedia
{{cite journal |last1=Roth |first1=Benjamin |last2=Klakow |first2=Dietrich |title=Combining Wikipedia-Based Concept Models for Cross-Language Retrieval |date=2010 |doi=10.1007/978-3-642-13084-7_5 |url=https://wikipediaquality.com/wiki/Combining_Wikipedia-Based_Concept_Models_for_Cross-Language_Retrieval |journal=Springer, Berlin, Heidelberg}}
HTML
Roth, Benjamin; Klakow, Dietrich. (2010). "<a href="https://wikipediaquality.com/wiki/Combining_Wikipedia-Based_Concept_Models_for_Cross-Language_Retrieval">Combining Wikipedia-Based Concept Models for Cross-Language Retrieval</a>". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-13084-7_5.