Navigating the Topical Structure of Academic Search Results via the Wikipedia Category Network

From Wikipedia Quality
Revision as of 23:00, 3 September 2019 by Zoe (talk | contribs) (Navigating the Topical Structure of Academic Search Results via the Wikipedia Category Network - basic info)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Navigating the Topical Structure of Academic Search Results via the Wikipedia Category Network - scientific work related to Wikipedia quality published in 2013, written by Daniil Mirylenka and Andrea Passerini.

Overview

Searching for scientific publications on the Web is a tedious task, especially when exploring an unfamiliar domain. Typical scholarly search engines produce lengthy unstructured result lists that are difficult to comprehend, interpret and browse. Authors propose a novel method of organizing the search results into concise and informative topic hierarchies. The method consists of two steps: extracting interrelated topics from the result set, and summarizing the topic graph. In the first step authors map the search results to articles and categories of Wikipedia, constructing a graph of relevant topics with hierarchical relations. In the second step authors sequentially build nested summaries of the produced topic graph using a structured output prediction approach. Trained on a small number of examples, method learns to construct informative summaries for unseen topic graphs, and outperforms unsupervised state-of-the-art Wikipedia-based clustering.