Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes

From Wikipedia Quality
Revision as of 10:35, 17 June 2020 by Sofia (talk | contribs) (Category)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes
Authors
Srijan Kumar
Robert West
Jure Leskovec
Publication date
2016
DOI
10.1145/2872427.2883085
Links
Original

Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes - scientific work related to Wikipedia quality published in 2016, written by Srijan Kumar, Robert West and Jure Leskovec.

Overview

Wikipedia is a major source of information for many people. However, false information on Wikipedia raises concerns about its credibility. One way in which false information may be presented on Wikipedia is in the form of hoax articles, i.e., articles containing fabricated facts about nonexistent entities or events. In this paper authors study false information on Wikipedia by focusing on the hoax articles that have been created throughout its history. Authors make several contributions. First, authors assess the real-world impact of hoax articles by measuring how long they survive before being debunked, how many pageviews they receive, and how heavily they are referred to by documents on the Web. Authors find that, while most hoaxes are detected quickly and have little impact on Wikipedia, a small number of hoaxes survive long and are well cited across the Web. Second, authors characterize the nature of successful hoaxes by comparing them to legitimate articles and to failed hoaxes that were discovered shortly after being created. Authors find characteristic differences in terms of article structure and content, embeddedness into the rest of Wikipedia, and features of the editor who created the hoax. Third, authors successfully apply findings to address a series of classification tasks, most notably to determine whether a given article is a hoax. And finally, authors describe and evaluate a task involving humans distinguishing hoaxes from non-hoaxes. Authors find that humans are not particularly good at the task and that automated classifier outperforms them by a big margin.

Embed

Wikipedia Quality

Kumar, Srijan; West, Robert; Leskovec, Jure. (2016). "[[Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes]]". International World Wide Web Conferences Steering Committee. DOI: 10.1145/2872427.2883085.

English Wikipedia

{{cite journal |last1=Kumar |first1=Srijan |last2=West |first2=Robert |last3=Leskovec |first3=Jure |title=Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes |date=2016 |doi=10.1145/2872427.2883085 |url=https://wikipediaquality.com/wiki/Disinformation_on_the_Web:_Impact,_Characteristics,_and_Detection_of_Wikipedia_Hoaxes |journal=International World Wide Web Conferences Steering Committee}}

HTML

Kumar, Srijan; West, Robert; Leskovec, Jure. (2016). &quot;<a href="https://wikipediaquality.com/wiki/Disinformation_on_the_Web:_Impact,_Characteristics,_and_Detection_of_Wikipedia_Hoaxes">Disinformation on the Web: Impact, Characteristics, and Detection of Wikipedia Hoaxes</a>&quot;. International World Wide Web Conferences Steering Committee. DOI: 10.1145/2872427.2883085.