A Corpus-Based Study of Edit Categories in Featured and Non-Featured Wikipedia Articles

From Wikipedia Quality
Jump to: navigation, search


A Corpus-Based Study of Edit Categories in Featured and Non-Featured Wikipedia Articles
Authors
Johannes Daxenberger
Iryna Gurevych
Publication date
2012
Links

A Corpus-Based Study of Edit Categories in Featured and Non-Featured Wikipedia Articles - scientific work about Wikipedia quality published in 2012, written by Johannes Daxenberger and Iryna Gurevych.

Overview

In this paper, authors present a study of the collaborative writing process in Wikipedia. Their work is based on a corpus of 1,995 edits obtained from 891 article revisions in the English Wikipedia. Authors propose a 21-category classification scheme for edits based on Faigley and Witte's (1981) model. Example edit categories include spelling error corrections and vandalism. In a manual multi-label annotation study with 3 annotators, authors obtain an inter-annotator agreement of α = 0.67. Authors further analyze the distribution of edit categories for distinct stages in the revision history of 10 featured and 10 non-featured articles. Their results show that the information content in featured articles tends to become more stable after their promotion. On the opposite, this is not true for non-featured articles. Authors make the resulting corpus and the annotation guidelines freely available.

Embed

Wikipedia Quality

Daxenberger, Johannes; Gurevych, Iryna. (2012). "[[A Corpus-Based Study of Edit Categories in Featured and Non-Featured Wikipedia Articles]]". ACM International Conference Proceeding Series 2012, pp. 764-772.

English Wikipedia

{{cite journal |last1=Daxenberger |first1=Johannes |last2=Gurevych |first2=Iryna |title=A Corpus-Based Study of Edit Categories in Featured and Non-Featured Wikipedia Articles |date=2012 |url=https://wikipediaquality.com/wiki/A_Corpus-Based_Study_of_Edit_Categories_in_Featured_and_Non-Featured_Wikipedia_Articles |journal=ACM International Conference Proceeding Series 2012, pp. 764-772}}

HTML

Daxenberger, Johannes; Gurevych, Iryna. (2012). &quot;<a href="https://wikipediaquality.com/wiki/A_Corpus-Based_Study_of_Edit_Categories_in_Featured_and_Non-Featured_Wikipedia_Articles">A Corpus-Based Study of Edit Categories in Featured and Non-Featured Wikipedia Articles</a>&quot;. ACM International Conference Proceeding Series 2012, pp. 764-772.