Towards Information Quality Assurance in Spanish: Wikipedia

From Wikipedia Quality
Revision as of 09:56, 19 July 2019 by Liliana (talk | contribs) (Adding wikilinks)
Jump to: navigation, search

Towards Information Quality Assurance in Spanish: Wikipedia - scientific work related to Wikipedia quality published in 2017, written by Edgardo Ferretti, Matías Soria, Sebastián Pérez Casseignau, Lian Pohn, Guido Urquiza, Sergio Alejandro Gómez and Marcelo Luis Errecalde.

Overview

Featured Articles (FA) are considered to be the best articles that Wikipedia has to offer and in the last years, researchers have found interesting to analyze whether and how they can be distinguished from “ordinary” articles. Likewise, identifying what issues have to be enhanced or fixed in ordinary articles in order to improve their quality is a recent key research trend. Most of the approaches developed to face these information quality problems have been proposed for the English Wikipedia. However, few efforts have been accomplished in Spanish Wikipedia, despite being Spanish, one of the most spoken languages in the world by native speakers. In this respect, authors present a breakdown of Spanish Wikipedia’s quality flaw structure. Besides, authors carry out studies with three different corpora to automatically assess information quality in Spanish Wikipedia, where FA identification is evaluated as a binary classification task. Authors evaluation on a unified setting allows to compare with the English version, the performance achieved by approach on the Spanish version. The best results obtained show that FA identification in Spanish, can be performed with an F1 score of 0.88 using a document model consisting of only twenty six features and Support Vector Machine as classification algorithm.