Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach

From Wikipedia Quality
Revision as of 12:26, 8 May 2020 by Camila (talk | contribs) (Infobox work)
Jump to: navigation, search


Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach
Authors
Amit Belani
Publication date
2010
Links
Original Preprint

Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach - scientific work related to Wikipedia quality published in 2010, written by Amit Belani.

Overview

A bag-of-words based probabilistic classifier is trained using regularized logistic regression to detect vandalism in the English Wikipedia. Isotonic regression is used to calibrate the class membership probabilities. Learning curve, reliability, ROC, and cost analysis are performed.