Knowledge Supervised Text Classification with no Labeled Documents

From Wikipedia Quality
Jump to: navigation, search
Knowledge Supervised Text Classification with no Labeled Documents
Authors
Congle Zhang
Guirong Xue
Yong Yu
Publication date
2008
ISSN
03029743
ISBN
354089196X;978-354089196-3
DOI
10.1007/978-3-540-89197-0_47
Links

Knowledge Supervised Text Classification with no Labeled Documents - scientific work about Wikipedia quality published in 2008, written by Congle Zhang, Guirong Xue and Yong Yu.

Overview

In traditional text classification approaches, the semantic meanings of the classes are described by the labeled documents. Since labeling documents is often time consuming and expensive, it is a promising idea that asking users to provide some keywords to depict the classes, instead of labeling any documents. However, short pieces of keywords may not contain enough information and therefore may lead to unreliable classifier. Fortunately, there are large amount of public data easily available in web directories, such as ODP, Wikipedia, etc. Authors are interested in exploring the enormous crowd intelligence contained in such public data to enhance text classification. In this paper, authors propose a novel text classification framework called "Knowledge Supervised Learning"(KSL), which utilizes the knowledge in keywords and the crowd intelligence to learn the classifier without any labeled documents. Authors design a two-stage risk minimization (TSRM) approach for the KSL problem. It can optimize the expected prediction risk and build the high quality classifier. Empirical results verify their claim: their algorithm can achieve above 0.9 on Micro-F1 on average, which is much better than baselines and even comparable against SVM classifier supervised by labeled documents.

Embed

Wikipedia Quality

Zhang, Congle; Xue, Guirong; Yu, Yong. (2008). "[[Knowledge Supervised Text Classification with no Labeled Documents]]". Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Volume 5351 LNAI, 2008, pp. 509-520. ISBN: 354089196X;978-354089196-3. ISSN: 03029743. DOI: 10.1007/978-3-540-89197-0_47.

English Wikipedia

{{cite journal |last1=Zhang |first1=Congle |last2=Xue |first2=Guirong |last3=Yu |first3=Yong |title=Knowledge Supervised Text Classification with no Labeled Documents |date=2008 |isbn=354089196X;978-354089196-3 |issn=03029743 |doi=10.1007/978-3-540-89197-0_47 |url=https://wikipediaquality.com/wiki/Knowledge_Supervised_Text_Classification_with_no_Labeled_Documents |journal=Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Volume 5351 LNAI, 2008, pp. 509-520}}

HTML

Zhang, Congle; Xue, Guirong; Yu, Yong. (2008). &quot;<a href="https://wikipediaquality.com/wiki/Knowledge_Supervised_Text_Classification_with_no_Labeled_Documents">Knowledge Supervised Text Classification with no Labeled Documents</a>&quot;. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Volume 5351 LNAI, 2008, pp. 509-520. ISBN: 354089196X;978-354089196-3. ISSN: 03029743. DOI: 10.1007/978-3-540-89197-0_47.