Query Directed Web Page Clustering Using Suffix Tree and Wikipedia Links
Authors | John Park Xiaoying Gao Peter Andreae |
---|---|
Publication date | 2012 |
DOI | 10.1007/978-3-642-35527-1_8 |
Links | Original |
Query Directed Web Page Clustering Using Suffix Tree and Wikipedia Links - scientific work related to Wikipedia quality published in 2012, written by John Park, Xiaoying Gao and Peter Andreae.
Overview
Recent research on Web page clustering has shown that the user query plays a critical role in guiding the categorisation of web search results. This paper combines Query Directed Clustering algorithm (QDC) with another existing algorithm, Suffix Tree Clustering (STC), to identify common phrases shared by documents for base cluster identification. One main contribution is the utilising of a new Wikipedia link based measure to estimate the semantic relatedness between query and the base cluster labels, which has shown great promise in identifying the good base clusters. Authors experimental results show that the performance is improved by utilising suffix trees and Wikipedia links.
Embed
Wikipedia Quality
Park, John; Gao, Xiaoying; Andreae, Peter. (2012). "[[Query Directed Web Page Clustering Using Suffix Tree and Wikipedia Links]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-35527-1_8.
English Wikipedia
{{cite journal |last1=Park |first1=John |last2=Gao |first2=Xiaoying |last3=Andreae |first3=Peter |title=Query Directed Web Page Clustering Using Suffix Tree and Wikipedia Links |date=2012 |doi=10.1007/978-3-642-35527-1_8 |url=https://wikipediaquality.com/wiki/Query_Directed_Web_Page_Clustering_Using_Suffix_Tree_and_Wikipedia_Links |journal=Springer, Berlin, Heidelberg}}
HTML
Park, John; Gao, Xiaoying; Andreae, Peter. (2012). "<a href="https://wikipediaquality.com/wiki/Query_Directed_Web_Page_Clustering_Using_Suffix_Tree_and_Wikipedia_Links">Query Directed Web Page Clustering Using Suffix Tree and Wikipedia Links</a>". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-35527-1_8.