Difference between revisions of "Got You!: Automatic Vandalism Detection in Wikipedia with Web-Based Shallow Syntactic-Semantic Modeling"

From Wikipedia Quality
Jump to: navigation, search
(Wikilinks)
(+ Infobox work)
Line 1: Line 1:
 +
{{Infobox work
 +
| title = Got You!: Automatic Vandalism Detection in Wikipedia with Web-Based Shallow Syntactic-Semantic Modeling
 +
| date = 2010
 +
| authors = [[William Yang Wang]]<br />[[Kathleen R. McKeown]]
 +
| doi = 10.7916/D8KP89J4
 +
| link = http://dl.acm.org/citation.cfm?id=1873781.1873910
 +
}}
 
'''Got You!: Automatic Vandalism Detection in Wikipedia with Web-Based Shallow Syntactic-Semantic Modeling''' - scientific work related to [[Wikipedia quality]] published in 2010, written by [[William Yang Wang]] and [[Kathleen R. McKeown]].
 
'''Got You!: Automatic Vandalism Detection in Wikipedia with Web-Based Shallow Syntactic-Semantic Modeling''' - scientific work related to [[Wikipedia quality]] published in 2010, written by [[William Yang Wang]] and [[Kathleen R. McKeown]].
  
 
== Overview ==
 
== Overview ==
 
Discriminating vandalism edits from non-vandalism edits in [[Wikipedia]] is a challenging task, as ill-intentioned edits can include a variety of content and be expressed in many different forms and styles. Previous studies are limited to rule-based methods and learning based on lexical [[features]], lacking in linguistic analysis. In this paper, authors propose a novel Web-based shallow syntactic-semantic modeling method, which utilizes Web search results as resource and trains topic-specific n-tag and syntactic n-gram language models to detect vandalism. By combining basic task-specific and lexical features, authors have achieved high F-[[measures]] using logistic boosting and logistic model trees classifiers, surpassing the results reported by major Wikipedia vandalism detection systems.
 
Discriminating vandalism edits from non-vandalism edits in [[Wikipedia]] is a challenging task, as ill-intentioned edits can include a variety of content and be expressed in many different forms and styles. Previous studies are limited to rule-based methods and learning based on lexical [[features]], lacking in linguistic analysis. In this paper, authors propose a novel Web-based shallow syntactic-semantic modeling method, which utilizes Web search results as resource and trains topic-specific n-tag and syntactic n-gram language models to detect vandalism. By combining basic task-specific and lexical features, authors have achieved high F-[[measures]] using logistic boosting and logistic model trees classifiers, surpassing the results reported by major Wikipedia vandalism detection systems.

Revision as of 01:07, 5 February 2021


Got You!: Automatic Vandalism Detection in Wikipedia with Web-Based Shallow Syntactic-Semantic Modeling
Authors
William Yang Wang
Kathleen R. McKeown
Publication date
2010
DOI
10.7916/D8KP89J4
Links
Original

Got You!: Automatic Vandalism Detection in Wikipedia with Web-Based Shallow Syntactic-Semantic Modeling - scientific work related to Wikipedia quality published in 2010, written by William Yang Wang and Kathleen R. McKeown.

Overview

Discriminating vandalism edits from non-vandalism edits in Wikipedia is a challenging task, as ill-intentioned edits can include a variety of content and be expressed in many different forms and styles. Previous studies are limited to rule-based methods and learning based on lexical features, lacking in linguistic analysis. In this paper, authors propose a novel Web-based shallow syntactic-semantic modeling method, which utilizes Web search results as resource and trains topic-specific n-tag and syntactic n-gram language models to detect vandalism. By combining basic task-specific and lexical features, authors have achieved high F-measures using logistic boosting and logistic model trees classifiers, surpassing the results reported by major Wikipedia vandalism detection systems.