Taxonomy and Clustering in Collaborative Systems: the Case of the On-Line Encyclopedia Wikipedia

From Wikipedia Quality
Revision as of 09:44, 30 May 2019 by Isabella (talk | contribs) (Taxonomy and Clustering in Collaborative Systems: the Case of the On-Line Encyclopedia Wikipedia -- new article)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Taxonomy and Clustering in Collaborative Systems: the Case of the On-Line Encyclopedia Wikipedia - scientific work related to Wikipedia quality published in 2008, written by Andrea Capocci, Francesco Rao and Guido Caldarelli.

Overview

In this paper authors investigate the nature and structure of the relation between imposed classifications and real clustering in a particular case of a scale-free network given by the on-line encyclopedia Wikipedia. Authors find a statistical similarity in the distributions of community sizes both by using the top-down approach of the categories division present in the archive and in the bottom-up procedure of community detection given by an algorithm based on the spectral properties of the graph. Regardless of the statistically similar behaviour, the two methods provide a rather different division of the articles, thereby signaling that the nature and presence of power laws is a general feature for these systems and cannot be used as a benchmark to evaluate the suitability of a clustering method.