TCSST: Transfer Classification of Short & Sparse Text Using External Data

From Wikipedia Quality
Jump to: navigation, search
TCSST: Transfer Classification of Short & Sparse Text Using External Data
Authors
Guodong Long
Ling Chen
Xingquan Zhu
Chengqi Zhang
Publication date
2012
ISBN
978-145031156-4
DOI
10.1145/2396761.2396859
Links

TCSST: Transfer Classification of Short & Sparse Text Using External Data - scientific work about Wikipedia quality published in 2012, written by Guodong Long, Ling Chen, Xingquan Zhu and Chengqi Zhang.

Overview

Short & sparse text is becoming more prevalent on the web, such as search snippets, micro-blogs and product reviews. Accurately classifying short & sparse text has emerged as an important while challenging task. Existing work has considered utilizing external data (e.g. Wikipedia) to alleviate data sparseness, by appending topics detected from external data as new features. However, training a classifier on features concatenated from different spaces is not easy considering the features have different physical meanings and different significance to the classification task. Moreover, it exacerbates the "curse of dimensionality" problem. In this study, authors propose a transfer classification method, TCSST, to exploit the external data to tackle the data sparsity issue. The transfer classifier will be learned in the original feature space. Considering that the labels of the external data may not be readily available or sufficiently enough, TCSST further exploits the unlabeled external data to aid the transfer classification. Authors develop novel strategies to allow TCSST to iteratively select high quality unlabeled external data to help with the classification. Authors evaluate the performance of TCSST on both benchmark as well as real-world data sets. Their experimental results demonstrate that the proposed method is effective in classifying very short & sparse text, consistently outperforming existing and baseline methods.

Embed

Wikipedia Quality

Long, Guodong; Chen, Ling; Zhu, Xingquan; Zhang, Chengqi. (2012). "[[TCSST: Transfer Classification of Short & Sparse Text Using External Data]]". ACM International Conference Proceeding Series 2012, pp. 764-772. ISBN: 978-145031156-4. DOI: 10.1145/2396761.2396859.

English Wikipedia

{{cite journal |last1=Long |first1=Guodong |last2=Chen |first2=Ling |last3=Zhu |first3=Xingquan |last4=Zhang |first4=Chengqi |title=TCSST: Transfer Classification of Short & Sparse Text Using External Data |date=2012 |isbn=978-145031156-4 |doi=10.1145/2396761.2396859 |url=https://wikipediaquality.com/wiki/TCSST:_Transfer_Classification_of_Short_&_Sparse_Text_Using_External_Data |journal=ACM International Conference Proceeding Series 2012, pp. 764-772}}

HTML

Long, Guodong; Chen, Ling; Zhu, Xingquan; Zhang, Chengqi. (2012). &quot;<a href="https://wikipediaquality.com/wiki/TCSST:_Transfer_Classification_of_Short_&_Sparse_Text_Using_External_Data">TCSST: Transfer Classification of Short & Sparse Text Using External Data</a>&quot;. ACM International Conference Proceeding Series 2012, pp. 764-772. ISBN: 978-145031156-4. DOI: 10.1145/2396761.2396859.