Difference between revisions of "Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach"
(Wikilinks) |
(Infobox work) |
||
Line 1: | Line 1: | ||
+ | {{Infobox work | ||
+ | | title = Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach | ||
+ | | date = 2010 | ||
+ | | authors = [[Amit Belani]] | ||
+ | | link = https://seer.ufmg.br/index.php/jidm/article/download/140/96 | ||
+ | | plink = https://arxiv.org/abs/1001.0700 | ||
+ | }} | ||
'''Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach''' - scientific work related to [[Wikipedia quality]] published in 2010, written by [[Amit Belani]]. | '''Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach''' - scientific work related to [[Wikipedia quality]] published in 2010, written by [[Amit Belani]]. | ||
== Overview == | == Overview == | ||
A bag-of-words based probabilistic classifier is trained using regularized logistic regression to detect vandalism in the [[English Wikipedia]]. Isotonic regression is used to calibrate the class membership probabilities. Learning curve, [[reliability]], ROC, and cost analysis are performed. | A bag-of-words based probabilistic classifier is trained using regularized logistic regression to detect vandalism in the [[English Wikipedia]]. Isotonic regression is used to calibrate the class membership probabilities. Learning curve, [[reliability]], ROC, and cost analysis are performed. |
Revision as of 13:26, 8 May 2020
Authors | Amit Belani |
---|---|
Publication date | 2010 |
Links | Original Preprint |
Vandalism Detection in Wikipedia: a Bag-Of-Words Classifier Approach - scientific work related to Wikipedia quality published in 2010, written by Amit Belani.
Overview
A bag-of-words based probabilistic classifier is trained using regularized logistic regression to detect vandalism in the English Wikipedia. Isotonic regression is used to calibrate the class membership probabilities. Learning curve, reliability, ROC, and cost analysis are performed.