Frequent Itemset based Hierarchical Document Clustering Using Wikipedia as External Knowledge

From Wikipedia Quality
Jump to: navigation, search


Frequent Itemset based Hierarchical Document Clustering Using Wikipedia as External Knowledge
Authors
G. V. R. Kiran
Ravi Shankar
Vikram Pudi
Publication date
2010
DOI
10.1007/978-3-642-15390-7_2
Links
Original

Frequent Itemset based Hierarchical Document Clustering Using Wikipedia as External Knowledge - scientific work related to Wikipedia quality published in 2010, written by G. V. R. Kiran, Ravi Shankar and Vikram Pudi.

Overview

High dimensionality is a major challenge in document clustering. Some of the recent algorithms address this problem by using frequent itemsets for clustering. But, most of these algorithms neglect the semantic relationship between the words. On the other hand there are algorithms that take care of the semantic relations between the words by making use of external knowledge contained in Word Net, Mesh, Wikipedia, etc but do not handle the high dimensionality. In this paper authors present an efficient solution that addresses both these problems. Authors propose a hierarchical clustering algorithm using closed frequent itemsets that use Wikipedia as an external knowledge to enhance the document representation. Authors evaluate methods based on F-Score on standard datasets and show results to be better than existing approaches.

Embed

Wikipedia Quality

Kiran, G. V. R.; Shankar, Ravi; Pudi, Vikram. (2010). "[[Frequent Itemset based Hierarchical Document Clustering Using Wikipedia as External Knowledge]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-15390-7_2.

English Wikipedia

{{cite journal |last1=Kiran |first1=G. V. R. |last2=Shankar |first2=Ravi |last3=Pudi |first3=Vikram |title=Frequent Itemset based Hierarchical Document Clustering Using Wikipedia as External Knowledge |date=2010 |doi=10.1007/978-3-642-15390-7_2 |url=https://wikipediaquality.com/wiki/Frequent_Itemset_based_Hierarchical_Document_Clustering_Using_Wikipedia_as_External_Knowledge |journal=Springer, Berlin, Heidelberg}}

HTML

Kiran, G. V. R.; Shankar, Ravi; Pudi, Vikram. (2010). &quot;<a href="https://wikipediaquality.com/wiki/Frequent_Itemset_based_Hierarchical_Document_Clustering_Using_Wikipedia_as_External_Knowledge">Frequent Itemset based Hierarchical Document Clustering Using Wikipedia as External Knowledge</a>&quot;. Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-15390-7_2.