Spectral Clustering Wikipedia Keyword-Based Search Results

From Wikipedia Quality
Jump to: navigation, search


Spectral Clustering Wikipedia Keyword-Based Search Results
Authors
Julian Szymański
Tomasz Dziubich
Publication date
2017
DOI
10.3389/frobt.2016.00078
Links
Original

Spectral Clustering Wikipedia Keyword-Based Search Results - scientific work related to Wikipedia quality published in 2017, written by Julian Szymański and Tomasz Dziubich.

Overview

The paper presents an application of spectral clustering algorithms used for grouping Wikipedia search results. The main contribution of the paper is a representation method for Wikipedia articles that has been based on combination of words and links and it has been used to categorize search result in this repository. Authors evaluate the proposed approach with Primary Component Analysis and show, on the test data, how usage of cosine transformation to create combined representations influence data variability. On sample test datasets authors also show how combined representation improves the data separation that increases overall results of data categorization. The paper reviews the three main spectral clustering methods and authors test their usability for text categorization comparing them using external validation criteria with standard clustering quality measures. Discussion on descriptiveness of evaluation measures and performed experiments on test datasets allows us to select the one spectral clustering algorithm that has been implemented in system. Authors give a brief description of the system architecture that groups on-line Wikipedia articles retrieved with user-specified keywords. Using the system authors show how clustering increases information retrieval effectiveness for Wikipedia data repository.

Embed

Wikipedia Quality

Szymański, Julian; Dziubich, Tomasz. (2017). "[[Spectral Clustering Wikipedia Keyword-Based Search Results]]". Frontiers. DOI: 10.3389/frobt.2016.00078.

English Wikipedia

{{cite journal |last1=Szymański |first1=Julian |last2=Dziubich |first2=Tomasz |title=Spectral Clustering Wikipedia Keyword-Based Search Results |date=2017 |doi=10.3389/frobt.2016.00078 |url=https://wikipediaquality.com/wiki/Spectral_Clustering_Wikipedia_Keyword-Based_Search_Results |journal=Frontiers}}

HTML

Szymański, Julian; Dziubich, Tomasz. (2017). &quot;<a href="https://wikipediaquality.com/wiki/Spectral_Clustering_Wikipedia_Keyword-Based_Search_Results">Spectral Clustering Wikipedia Keyword-Based Search Results</a>&quot;. Frontiers. DOI: 10.3389/frobt.2016.00078.