Sawus Siena's Automatic Wikipedia Update System

From Wikipedia Quality
Revision as of 11:28, 8 November 2019 by Agnieszka (talk | contribs) (Information about: Sawus Siena's Automatic Wikipedia Update System)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Sawus Siena's Automatic Wikipedia Update System - scientific work related to Wikipedia quality published in 2012, written by Carl Tompkins, Zachary Witter and Sharon G. Small.

Overview

Abstract : The National Institute of Standards and Technology (NIST) has been running an annual Text Retrieval Competition and Conference (TREC) since 1992. This is a premier conference that offers researchers in the field of Computational Linguistics the opportunity to showcase their work and compare their results against other leading researchers. Authors Siena research team participated in the TREC Knowledge Based Acquisition (KBA) Track which was offered for the first time in 2012. The objective of this track is to drive research into automatic acquisition of knowledge such as automatically updating Wikipedia by utilizing online news. Specifically team of researchers developed a system that filters a stream of content for information that should be included on a given Wikipedia page. It was not yet clear how traditional Information Retrieval (IR) techniques perform for this task therefore authors began with a baseline test using current state of the art IR techniques. Authors then went on to experiment with query expansion building a module that utilized Wikipedia Infoboxes to add terms to query. This module was incorporated with IR component to create SAWUS. Four submissions were sent to NIST to undergo a formal evaluation.