Detecting Wikipedia Vandalism Using WikiTrust: Lab Report for PAN at CLEF 2010

From Wikipedia Quality
Revision as of 23:43, 3 July 2018 by Librarian (talk | contribs) (New scientific work)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


Detecting Wikipedia Vandalism Using WikiTrust: Lab Report for PAN at CLEF 2010
Authors
Bo Thomas Adler
Luca De Alfaro
Ian Pye
Publication date
2010
ISSN
16130073
Links

Detecting Wikipedia Vandalism Using WikiTrust: Lab Report for PAN at CLEF 2010 - scientific work about Wikipedia quality published in 2010, written by Bo Thomas Adler, Luca De Alfaro and Ian Pye.

Overview

WikiTrust is a reputation system for Wikipedia authors and content. WikiTrust computes three main quantities: edit quality, author reputation, and content reputation. The edit quality measures how well each edit, that is, each change introduced in a revision, is preserved in subsequent revisions. Authors who perform good quality edits gain reputation, and text which is revised by several high-reputation authors gains reputation. Since vandalism on the Wikipedia is usually performed by anonymous or new users (not least because long-time vandals end up banned), and is usually reverted in a reasonably short span of time, edit quality, author reputation, and content reputation are obvious candidates as features to identify vandalism on the Wikipedia. Indeed, using the full set of features computed by WikiTrust, authors have been able to construct classifiers that identify vandalism with a recall of 83.5%, a precision of 48.5%, and a false positive rate of 8%, for an area under the ROC curve of 93.4%. If authors limit ourselves to the set of features available at the time an edit is made (when the edit quality is still unknown), the classifier achieves a recall of 77.1%, a precision of 36.9%, and a false positive rate of 12.2%, for an area under the ROC curve of 90.4%. Using these classifiers, authors have implemented a simple Web API that provides the vandalism estimate for every revision of the English Wikipedia. The API can be used both to identify vandalism that needs to be reverted, and to select highquality, non-vandalized recent revisions of any given Wikipedia article. These recent high-quality revisions can be included in static snapshots of theWikipedia, or they can be used whenever tolerance to vandalism is low (as in a school setting, or whenever the material is widely disseminated).

Embed

Wikipedia Quality

Adler, Bo Thomas; De Alfaro, Luca; Pye, Ian. (2010). "[[Detecting Wikipedia Vandalism Using WikiTrust: Lab Report for PAN at CLEF 2010]]". CEUR Workshop Proceedings Volume 1176, 2010. ISSN: 16130073.

English Wikipedia

{{cite journal |last1=Adler |first1=Bo Thomas |last2=De Alfaro |first2=Luca |last3=Pye |first3=Ian |title=Detecting Wikipedia Vandalism Using WikiTrust: Lab Report for PAN at CLEF 2010 |date=2010 |issn=16130073 |url=https://wikipediaquality.com/wiki/Detecting_Wikipedia_Vandalism_Using_WikiTrust:_Lab_Report_for_PAN_at_CLEF_2010 |journal=CEUR Workshop Proceedings Volume 1176, 2010}}

HTML

Adler, Bo Thomas; De Alfaro, Luca; Pye, Ian. (2010). &quot;<a href="https://wikipediaquality.com/wiki/Detecting_Wikipedia_Vandalism_Using_WikiTrust:_Lab_Report_for_PAN_at_CLEF_2010">Detecting Wikipedia Vandalism Using WikiTrust: Lab Report for PAN at CLEF 2010</a>&quot;. CEUR Workshop Proceedings Volume 1176, 2010. ISSN: 16130073.