Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach

From Wikipedia Quality
Revision as of 07:40, 17 June 2019 by Autumn (talk | contribs) (Information about: Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach - scientific work related to Wikipedia quality published in 2010, written by Amit Belani.

Overview

A bag-of-words based probabilistic classifier is trained using regularized logistic regression to detect vandalism in the English Wikipedia. Isotonic regression is used to calibrate the class membership probabilities. Learning curve, reliability, ROC, and cost analysis are performed.