Query Classification Using Wikipedia's Category Graph

From Wikipedia Quality
Revision as of 08:13, 3 July 2019 by Madison (talk | contribs) (Starting a page: Query Classification Using Wikipedia's Category Graph)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Query Classification Using Wikipedia's Category Graph - scientific work related to Wikipedia quality published in 2012, written by Milad Alemzadeh, Richard Khoury and Fakhri Karray.

Overview

Wikipedia's category graph is a network of 300,000 interconnected category labels, and can be a powerful resource for many classification tasks. However, its size and the lack of order can make it difficult to navigate. In this paper, authors present a new algorithm to efficiently exploit this graph and accurately rank classification labels given user-specified keywords. Authors highlight multiple possible variations of this algorithm, and study the impact of these variations on the classification results in order to determine the optimal way to exploit the category graph. Authors implement algorithm as the core of a query classification system and demonstrate its reliability using the KDD CUP 2005 and TREC 2007 competitions as benchmarks.