Revisiting Reverts: Accurate Revert Detection in Wikipedia

From Wikipedia Quality
Revision as of 08:18, 14 August 2020 by Aaliyah (talk | contribs) (+ category)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


Revisiting Reverts: Accurate Revert Detection in Wikipedia
Authors
Fabian Flöck
Denny Vrandecic
Elena Simperl
Publication date
2012
DOI
10.1145/2309996.2310000
Links
Original

Revisiting Reverts: Accurate Revert Detection in Wikipedia - scientific work related to Wikipedia quality published in 2012, written by Fabian Flöck, Denny Vrandecic and Elena Simperl.

Overview

Wikipedia is commonly used as a proving ground for research in collaborative systems. This is likely due to its popularity and scale, but also to the fact that large amounts of data about its formation and evolution are freely available to inform and validate theories and models of online collaboration. As part of the development of such approaches, revert detection is often performed as an important pre-processing step in tasks as diverse as the extraction of implicit networks of editors, the analysis of edit or editor features and the removal of noise when analyzing the emergence of the content of an article. The current state of the art in revert detection is based on a rather naive approach, which identifies revision duplicates based on MD5 hash values. This is an efficient, but not very precise technique that forms the basis for the majority of research based on revert relations in Wikipedia. In this paper authors prove that this method has a number of important drawbacks - it only detects a limited number of reverts, while simultaneously misclassifying too many edits as reverts, and not distinguishing between complete and partial reverts. This is very likely to hamper the accurate interpretation of the findings of revert-related research. Authors introduce an improved algorithm for the detection of reverts based on word tokens added or deleted to adresses these drawbacks. Authors report on the results of a user study and other tests demonstrating the considerable gains in accuracy and coverage by method, and argue for a positive trade-off, in certain research scenarios, between these improvements and algorithm's increased runtime.

Embed

Wikipedia Quality

Flöck, Fabian; Vrandecic, Denny; Simperl, Elena. (2012). "[[Revisiting Reverts: Accurate Revert Detection in Wikipedia]]".DOI: 10.1145/2309996.2310000.

English Wikipedia

{{cite journal |last1=Flöck |first1=Fabian |last2=Vrandecic |first2=Denny |last3=Simperl |first3=Elena |title=Revisiting Reverts: Accurate Revert Detection in Wikipedia |date=2012 |doi=10.1145/2309996.2310000 |url=https://wikipediaquality.com/wiki/Revisiting_Reverts:_Accurate_Revert_Detection_in_Wikipedia}}

HTML

Flöck, Fabian; Vrandecic, Denny; Simperl, Elena. (2012). &quot;<a href="https://wikipediaquality.com/wiki/Revisiting_Reverts:_Accurate_Revert_Detection_in_Wikipedia">Revisiting Reverts: Accurate Revert Detection in Wikipedia</a>&quot;.DOI: 10.1145/2309996.2310000.