Difference between revisions of "A New Approach to Detecting Content Anomalies in Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(Wikilinks)
(+ Infobox work)
Line 1: Line 1:
 +
{{Infobox work
 +
| title = A New Approach to Detecting Content Anomalies in Wikipedia
 +
| date = 2013
 +
| authors = [[Duygu Sinanc]]<br />[[Uraz Yavanoglu]]
 +
| doi = 10.1109/ICMLA.2013.137
 +
| link = http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=6786122
 +
}}
 
'''A New Approach to Detecting Content Anomalies in Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2013, written by [[Duygu Sinanc]] and [[Uraz Yavanoglu]].
 
'''A New Approach to Detecting Content Anomalies in Wikipedia''' - scientific work related to [[Wikipedia quality]] published in 2013, written by [[Duygu Sinanc]] and [[Uraz Yavanoglu]].
  
 
== Overview ==
 
== Overview ==
 
The rapid growth of the web has caused to availability of data effective if its content is well organized. Despite the fact that [[Wikipedia]] is the biggest encyclopedia on the web, its quality is suspect due to its Open Editing Schemas (OES). In this study, zoology and botany pages are selected in [[English Wikipedia]] and their html contents are converted to text then Artificial Neural Network (ANN) is used for classification to prevent disinformation or misinformation. After the train phase, some irrelevant words added in the content about politics or terrorism in proportion to the size of the text. By the time unsuitable content is added in a page until the moderators' intervention, the proposed system realized the error via wrong categorization. The results have shown that, when words number 2% of the content is added anomaly rate begins to cross the 50% border.
 
The rapid growth of the web has caused to availability of data effective if its content is well organized. Despite the fact that [[Wikipedia]] is the biggest encyclopedia on the web, its quality is suspect due to its Open Editing Schemas (OES). In this study, zoology and botany pages are selected in [[English Wikipedia]] and their html contents are converted to text then Artificial Neural Network (ANN) is used for classification to prevent disinformation or misinformation. After the train phase, some irrelevant words added in the content about politics or terrorism in proportion to the size of the text. By the time unsuitable content is added in a page until the moderators' intervention, the proposed system realized the error via wrong categorization. The results have shown that, when words number 2% of the content is added anomaly rate begins to cross the 50% border.

Revision as of 10:44, 11 July 2019


A New Approach to Detecting Content Anomalies in Wikipedia
Authors
Duygu Sinanc
Uraz Yavanoglu
Publication date
2013
DOI
10.1109/ICMLA.2013.137
Links
Original

A New Approach to Detecting Content Anomalies in Wikipedia - scientific work related to Wikipedia quality published in 2013, written by Duygu Sinanc and Uraz Yavanoglu.

Overview

The rapid growth of the web has caused to availability of data effective if its content is well organized. Despite the fact that Wikipedia is the biggest encyclopedia on the web, its quality is suspect due to its Open Editing Schemas (OES). In this study, zoology and botany pages are selected in English Wikipedia and their html contents are converted to text then Artificial Neural Network (ANN) is used for classification to prevent disinformation or misinformation. After the train phase, some irrelevant words added in the content about politics or terrorism in proportion to the size of the text. By the time unsuitable content is added in a page until the moderators' intervention, the proposed system realized the error via wrong categorization. The results have shown that, when words number 2% of the content is added anomaly rate begins to cross the 50% border.