Creating Categories for Wikipedia Articles Using Self-Organizing Maps

From Wikipedia Quality
Revision as of 16:36, 9 June 2019 by Evelyn (talk | contribs) (Creating a new page - Creating Categories for Wikipedia Articles Using Self-Organizing Maps)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Creating Categories for Wikipedia Articles Using Self-Organizing Maps - scientific work related to Wikipedia quality published in 2011, written by Julian Szymański.

Overview

The article presents the results of the experiments performed on selected sub-set of Wikipedia which authors categorized automaticly. Authors analyze two methods of text representation: based on references and word content. Using them authors introduced joint representation that has been used to build groups of similar articles based on Kohonen Self-Organizing Maps. To fulfill efficiency of the data processing, authors performed dimensionality reduction of raw data using Principal Component Analysis performed on similarity matrix. Changing the granularity of SOM network allows to build hierarchical categories and find significant relations between articles in documents repository.