Automated Creation of Wikipedia Articles

From Wikipedia Quality
Revision as of 08:52, 22 May 2019 by Agnieszka (talk | contribs) (Information about: Automated Creation of Wikipedia Articles)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


Automated Creation of Wikipedia Articles
Authors
Christina Sauper
Publication date
2009
Links
Original

Automated Creation of Wikipedia Articles - scientific work related to Wikipedia quality published in 2009, written by Christina Sauper.

Overview

This thesis describes an automatic approach for producing Wikipedia articles. The wealth of information present on the Internet is currently untapped for many topics of secondary concern. Creating articles requires a great deal of time spent collecting information and editing. This thesis presents a solution. The proposed algorithm creates a new article by querying the Internet, selecting relevant excerpts from the search results, and synthesizing the best excerpts into a coherent document. This work builds on previous work in document summarization, web question answering, and Integer Linear Programming. At the core of approach is a method for using existing human-authored Wikipedia articles to learn a content selection mechanism. Articles in the same category often present similar types of information; authors can leverage this to create content templates for new articles. Once a template has been created, authors use classification and clustering techniques to select a single best excerpt for each section. Finally, authors use Integer Linear Programming techniques to eliminate any redundancy over the complete article. Authors evaluate system for both individual sections and complete articles, using both human and automatic evaluation methods. The results indicate that articles created by system are close to human-authored Wikipedia entries in quality of content selection. Authors show that both human and automatic evaluation metrics are in agreement; therefore, automatic methods are a reasonable evaluation tool for this task. Authors also empirically demonstrate that explicit modeling of content structure is essential for improving the quality of an automatically-produced article. Thesis Supervisor: Regina Barzilay Title: Associate Professor

Embed

Wikipedia Quality

Sauper, Christina. (2009). "[[Automated Creation of Wikipedia Articles]]". Massachusetts Institute of Technology.

English Wikipedia

{{cite journal |last1=Sauper |first1=Christina |title=Automated Creation of Wikipedia Articles |date=2009 |url=https://wikipediaquality.com/wiki/Automated_Creation_of_Wikipedia_Articles |journal=Massachusetts Institute of Technology}}

HTML

Sauper, Christina. (2009). &quot;<a href="https://wikipediaquality.com/wiki/Automated_Creation_of_Wikipedia_Articles">Automated Creation of Wikipedia Articles</a>&quot;. Massachusetts Institute of Technology.