Cleansing Wikipedia Categories Using Centrality

From Wikipedia Quality
Revision as of 09:02, 7 July 2019 by Layla (talk | contribs) (Links)
Jump to: navigation, search

Cleansing Wikipedia Categories Using Centrality - scientific work related to Wikipedia quality published in 2016, written by Paolo Boldi and Corrado Monti.


Authors propose a novel general technique aimed at pruning and cleansing the Wikipedia category hierarchy, with a tunable level of aggregation. Authors approach is endogenous, since it does not use any information coming from Wikipedia articles, but it is based solely on the user-generated (noisy) Wikipedia category folksonomy itself. Authors show how the proposed techniques can help reduce the level of noise in the hierarchy and discuss how alternative centrality measures can differently impact on the result.