Wikipedia Pages as Entry Points for Book Search

From Wikipedia Quality
Revision as of 19:32, 19 October 2019 by Sadie (talk | contribs) (+ infobox)
Jump to: navigation, search


Wikipedia Pages as Entry Points for Book Search
Authors
Marijn Koolen
Gabriella Kazai
Nick Craswell
Publication date
2009
DOI
10.1145/1498759.1498807
Links
Original Preprint

Wikipedia Pages as Entry Points for Book Search - scientific work related to Wikipedia quality published in 2009, written by Marijn Koolen, Gabriella Kazai and Nick Craswell.

Overview

A lot of the world's knowledge is stored in books, which, as a result of recent mass-digitisation efforts, are increasingly available online. Search engines, such as Google Books, provide mechanisms for searchers to enter this vast knowledge space using queries as entry points. In this paper, authors view Wikipedia as a summary of this world knowledge and aim to use this resource to guide users to relevant books. Thus, authors investigate possible ways of using Wikipedia as an intermediary between the user's query and a collection of books being searched. Authors experiment with traditional query expansion techniques, exploiting Wikipedia articles as rich sources of information that can augment the user's query. Authors then propose a novel approach based on link distance in an extended Wikipedia graph: authors associate books with Wikipedia pages that cite these books and use the link distance between these nodes and the pages that match the user query as an estimation of a book's relevance to the query. Authors results show that a) classical query expansion using terms extracted from query pages leads to increased precision, and b) link distance between query and book pages in Wikipedia provides a good indicator of relevance that can boost the retrieval score of relevant books in the result ranking of a book search engine.