Difference between revisions of "Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach"

From Wikipedia Quality
Jump to: navigation, search
(Information about: Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach)
 
(Wikilinks)
Line 1: Line 1:
'''Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach''' - scientific work related to Wikipedia quality published in 2010, written by Amit Belani.
+
'''Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach''' - scientific work related to [[Wikipedia quality]] published in 2010, written by [[Amit Belani]].
  
 
== Overview ==
 
== Overview ==
A bag-of-words based probabilistic classifier is trained using regularized logistic regression to detect vandalism in the English Wikipedia. Isotonic regression is used to calibrate the class membership probabilities. Learning curve, reliability, ROC, and cost analysis are performed.
+
A bag-of-words based probabilistic classifier is trained using regularized logistic regression to detect vandalism in the [[English Wikipedia]]. Isotonic regression is used to calibrate the class membership probabilities. Learning curve, [[reliability]], ROC, and cost analysis are performed.

Revision as of 09:44, 7 May 2020

Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach - scientific work related to Wikipedia quality published in 2010, written by Amit Belani.

Overview

A bag-of-words based probabilistic classifier is trained using regularized logistic regression to detect vandalism in the English Wikipedia. Isotonic regression is used to calibrate the class membership probabilities. Learning curve, reliability, ROC, and cost analysis are performed.