Difference between revisions of "Machine Learning based Detection of Vandalism in Wikipedia Across Languages"

From Wikipedia Quality
Jump to: navigation, search
(Overview: Machine Learning based Detection of Vandalism in Wikipedia Across Languages)
 
(wikilinks)
Line 1: Line 1:
'''Machine Learning based Detection of Vandalism in Wikipedia Across Languages''' - scientific work related to Wikipedia quality published in 2016, written by Arsim Susuri, Mentor Hamiti and Agni Dika.
+
'''Machine Learning based Detection of Vandalism in Wikipedia Across Languages''' - scientific work related to [[Wikipedia quality]] published in 2016, written by [[Arsim Susuri]], [[Mentor Hamiti]] and [[Agni Dika]].
  
 
== Overview ==
 
== Overview ==
Applying machine learning algorithms for detecting vandalism in two languages are described in this paper. Vandalism is a major issue in Wikipedia as it accounts for about 1% of edits during 2015. The majority of vandalism is from human editors, whose vandalism can be traced through access and edit logs. In this paper, authors propose using a list of classifiers in one language, and then evaluate them across languages in two datasets: the hourly count of views of each Wikipedia article, and the used edit history of articles. For this purpose, Simple English and Albanian Wikipedia datasets will be used. The results obtained show that the characteristic features of vandalism can be learned from view and edit patterns, and models built in one language can be applied successfully to other languages.
+
Applying machine learning algorithms for detecting vandalism in two languages are described in this paper. Vandalism is a major issue in [[Wikipedia]] as it accounts for about 1% of edits during 2015. The majority of vandalism is from human editors, whose vandalism can be traced through access and edit logs. In this paper, authors propose using a list of classifiers in one language, and then evaluate them across languages in two datasets: the hourly count of views of each Wikipedia article, and the used edit history of articles. For this purpose, Simple English and Albanian Wikipedia datasets will be used. The results obtained show that the characteristic [[features]] of vandalism can be learned from view and edit patterns, and models built in one language can be applied successfully to other languages.

Revision as of 11:33, 15 February 2021

Machine Learning based Detection of Vandalism in Wikipedia Across Languages - scientific work related to Wikipedia quality published in 2016, written by Arsim Susuri, Mentor Hamiti and Agni Dika.

Overview

Applying machine learning algorithms for detecting vandalism in two languages are described in this paper. Vandalism is a major issue in Wikipedia as it accounts for about 1% of edits during 2015. The majority of vandalism is from human editors, whose vandalism can be traced through access and edit logs. In this paper, authors propose using a list of classifiers in one language, and then evaluate them across languages in two datasets: the hourly count of views of each Wikipedia article, and the used edit history of articles. For this purpose, Simple English and Albanian Wikipedia datasets will be used. The results obtained show that the characteristic features of vandalism can be learned from view and edit patterns, and models built in one language can be applied successfully to other languages.