Dynamic Element Retrieval in the Wikipedia Collection

From Wikipedia Quality
Revision as of 10:05, 3 August 2019 by Sophie (talk | contribs) (Wikilinks)
Jump to: navigation, search

Dynamic Element Retrieval in the Wikipedia Collection - scientific work related to Wikipedia quality published in 2008, written by Carolyn J. Crouch, Donald B. Crouch, Nachiket Kamat, Vikram Malik and Aditya Mone.

Overview

This paper describes the successful adaptation of methodology for the dynamic retrieval of XML elements to a semi-structured environment. Working with text that contains both tagged and untagged elements presents particular challenges in this context. Authors system is based on the Vector Space Model; basic functions are performed using the Smart experimental retrieval system. Dynamic element retrieval requires only a single indexing of the document collection at the level of the basic indexing node (i.e., the paragraph). It returns a rank-ordered list of elements identical to that produced by the same query against an all-element index of the collection. Experimental results are reported for both the 2006 and 2007 Ad-hoc tasks.