Wikipedia-Based Unsupervised Query Classification
Authors | Milen Kouylekov Luca Dini Alessio Bosca Marco Trevisan |
---|---|
Publication date | 2013 |
Links | Original |
Wikipedia-Based Unsupervised Query Classification - scientific work related to Wikipedia quality published in 2013, written by Milen Kouylekov, Luca Dini, Alessio Bosca and Marco Trevisan.
Overview
In this paper authors present an unsupervised approach to Query Classification. The approach exploits the Wikipedia encyclopedia as a corpus and the statistical distribution of terms, from both the category labels and the query, in order to select an appropriate category. Authors have created a classifier that works with 55 categories extracted from the search section of the Bridgeman Art Library website. Authors have also evaluated approach using the labeled data of the KDD-Cup 2005 Knowledge Discovery and Data Mining competition (800,000 real user queries into 67 target categories) and obtained promising results.
Embed
Wikipedia Quality
Kouylekov, Milen; Dini, Luca; Bosca, Alessio; Trevisan, Marco. (2013). "[[Wikipedia-Based Unsupervised Query Classification]]".
English Wikipedia
{{cite journal |last1=Kouylekov |first1=Milen |last2=Dini |first2=Luca |last3=Bosca |first3=Alessio |last4=Trevisan |first4=Marco |title=Wikipedia-Based Unsupervised Query Classification |date=2013 |url=https://wikipediaquality.com/wiki/Wikipedia-Based_Unsupervised_Query_Classification}}
HTML
Kouylekov, Milen; Dini, Luca; Bosca, Alessio; Trevisan, Marco. (2013). "<a href="https://wikipediaquality.com/wiki/Wikipedia-Based_Unsupervised_Query_Classification">Wikipedia-Based Unsupervised Query Classification</a>".