Automatically Classifying Edit Categories in Wikipedia Revisions

From Wikipedia Quality
Revision as of 00:39, 4 July 2018 by Librarian (talk | contribs) (New scientific work)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search
Automatically Classifying Edit Categories in Wikipedia Revisions
Authors
Johannes Daxenberger
Iryna Gurevych
Publication date
2013
ISBN
978-193728497-8
Links

Automatically Classifying Edit Categories in Wikipedia Revisions - scientific work about Wikipedia quality published in 2013, written by Johannes Daxenberger and Iryna Gurevych.

Overview

In this paper, authors analyze a novel set of features for the task of automatic edit category classification. Edit category classification assigns categories such as spelling error correction, paraphrase or vandalism to edits in a document. Their features are based on differences between two versions of a document including meta data, textual and language properties and markup. In a supervised machine learning experiment, authors achieve a micro-averaged F1 score of 62 on a corpus of edits from the English Wikipedia. In this corpus, each edit has been multi-labeled according to a 21-category taxonomy. A model trained on the same data achieves state-of-the-art performance on the related task of fluency edit classification. Authors apply pattern mining to automatically labeled edits in the revision histories of different Wikipedia articles. Their results suggest that high-quality articles show a higher degree of homogeneity with respect to their collaboration patterns as compared to random articles.

Embed

Wikipedia Quality

Daxenberger, Johannes; Gurevych, Iryna. (2013). "[[Automatically Classifying Edit Categories in Wikipedia Revisions]]". IEEE International Conference on Communications 2013, Article number 6655082, pp. 3444-3449. ISBN: 978-193728497-8.

English Wikipedia

{{cite journal |last1=Daxenberger |first1=Johannes |last2=Gurevych |first2=Iryna |title=Automatically Classifying Edit Categories in Wikipedia Revisions |date=2013 |isbn=978-193728497-8 |url=https://wikipediaquality.com/wiki/Automatically_Classifying_Edit_Categories_in_Wikipedia_Revisions |journal=IEEE International Conference on Communications 2013, Article number 6655082, pp. 3444-3449}}

HTML

Daxenberger, Johannes; Gurevych, Iryna. (2013). &quot;<a href="https://wikipediaquality.com/wiki/Automatically_Classifying_Edit_Categories_in_Wikipedia_Revisions">Automatically Classifying Edit Categories in Wikipedia Revisions</a>&quot;. IEEE International Conference on Communications 2013, Article number 6655082, pp. 3444-3449. ISBN: 978-193728497-8.