Self-Organizing Map Representation for Clustering Wikipedia Search Results
Authors | Julian Szymański |
---|---|
Publication date | 2011 |
DOI | 10.1007/978-3-642-20042-7_15 |
Links | Original |
Self-Organizing Map Representation for Clustering Wikipedia Search Results - scientific work related to Wikipedia quality published in 2011, written by Julian Szymański.
Overview
The article presents an approach to automated organization of textual data. The experiments have been performed on selected sub-set of Wikipedia. The Vector Space Model representation based on terms has been used to build groups of similar articles extracted from Kohonen Self-Organizing Maps with DBSCAN clustering. To warrant efficiency of the data processing, authors performed linear dimensionality reduction of raw data using Principal Component Analysis. Authors introduce hierarchical organization of the categorized articles changing the granularity of SOM network. The categorization method has been used in implementation of the system that clusters results of keyword-based search in Polish Wikipedia.
Embed
Wikipedia Quality
Szymański, Julian. (2011). "[[Self-Organizing Map Representation for Clustering Wikipedia Search Results]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-20042-7_15.
English Wikipedia
{{cite journal |last1=Szymański |first1=Julian |title=Self-Organizing Map Representation for Clustering Wikipedia Search Results |date=2011 |doi=10.1007/978-3-642-20042-7_15 |url=https://wikipediaquality.com/wiki/Self-Organizing_Map_Representation_for_Clustering_Wikipedia_Search_Results |journal=Springer, Berlin, Heidelberg}}
HTML
Szymański, Julian. (2011). "<a href="https://wikipediaquality.com/wiki/Self-Organizing_Map_Representation_for_Clustering_Wikipedia_Search_Results">Self-Organizing Map Representation for Clustering Wikipedia Search Results</a>". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-20042-7_15.