Difference between revisions of "Document Topic Extraction based on Wikipedia Category"
(Adding infobox) |
(cat.) |
||
Line 10: | Line 10: | ||
== Overview == | == Overview == | ||
Document Topic Extraction aims at using several key phrases to describe the topics of documents. It can be applied in web document categorization and tagging, document clusters topic description and [[information retrieval]] tasks. In this paper, authors propose a [[Wikipedia]] category-based document topic extraction method. Document is mapped to a set of [[Wikipedia categories]] and is represented as graph structure in order to conserve the relationship between Wikipedia [[categories]]. Then, document topic can be extracted by clustering the related Wikipedia categories in the document collection. Experiment in real data shows Wikipedia category-based document topic extraction method achieves the better result than latent topic modeling method, such as LDA. | Document Topic Extraction aims at using several key phrases to describe the topics of documents. It can be applied in web document categorization and tagging, document clusters topic description and [[information retrieval]] tasks. In this paper, authors propose a [[Wikipedia]] category-based document topic extraction method. Document is mapped to a set of [[Wikipedia categories]] and is represented as graph structure in order to conserve the relationship between Wikipedia [[categories]]. Then, document topic can be extracted by clustering the related Wikipedia categories in the document collection. Experiment in real data shows Wikipedia category-based document topic extraction method achieves the better result than latent topic modeling method, such as LDA. | ||
+ | |||
+ | == Embed == | ||
+ | === Wikipedia Quality === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Yun, Jiali; Jing, Liping; Yu, Jian; Huang, Houkuan; Zhang, Ying. (2011). "[[Document Topic Extraction based on Wikipedia Category]]".DOI: 10.1109/CSO.2011.119. | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === English Wikipedia === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | {{cite journal |last1=Yun |first1=Jiali |last2=Jing |first2=Liping |last3=Yu |first3=Jian |last4=Huang |first4=Houkuan |last5=Zhang |first5=Ying |title=Document Topic Extraction based on Wikipedia Category |date=2011 |doi=10.1109/CSO.2011.119 |url=https://wikipediaquality.com/wiki/Document_Topic_Extraction_based_on_Wikipedia_Category}} | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | === HTML === | ||
+ | <code> | ||
+ | <nowiki> | ||
+ | Yun, Jiali; Jing, Liping; Yu, Jian; Huang, Houkuan; Zhang, Ying. (2011). &quot;<a href="https://wikipediaquality.com/wiki/Document_Topic_Extraction_based_on_Wikipedia_Category">Document Topic Extraction based on Wikipedia Category</a>&quot;.DOI: 10.1109/CSO.2011.119. | ||
+ | </nowiki> | ||
+ | </code> | ||
+ | |||
+ | |||
+ | |||
+ | [[Category:Scientific works]] |
Latest revision as of 07:07, 19 February 2021
Authors | Jiali Yun Liping Jing Jian Yu Houkuan Huang Ying Zhang |
---|---|
Publication date | 2011 |
DOI | 10.1109/CSO.2011.119 |
Links | Original |
Document Topic Extraction based on Wikipedia Category - scientific work related to Wikipedia quality published in 2011, written by Jiali Yun, Liping Jing, Jian Yu, Houkuan Huang and Ying Zhang.
Overview
Document Topic Extraction aims at using several key phrases to describe the topics of documents. It can be applied in web document categorization and tagging, document clusters topic description and information retrieval tasks. In this paper, authors propose a Wikipedia category-based document topic extraction method. Document is mapped to a set of Wikipedia categories and is represented as graph structure in order to conserve the relationship between Wikipedia categories. Then, document topic can be extracted by clustering the related Wikipedia categories in the document collection. Experiment in real data shows Wikipedia category-based document topic extraction method achieves the better result than latent topic modeling method, such as LDA.
Embed
Wikipedia Quality
Yun, Jiali; Jing, Liping; Yu, Jian; Huang, Houkuan; Zhang, Ying. (2011). "[[Document Topic Extraction based on Wikipedia Category]]".DOI: 10.1109/CSO.2011.119.
English Wikipedia
{{cite journal |last1=Yun |first1=Jiali |last2=Jing |first2=Liping |last3=Yu |first3=Jian |last4=Huang |first4=Houkuan |last5=Zhang |first5=Ying |title=Document Topic Extraction based on Wikipedia Category |date=2011 |doi=10.1109/CSO.2011.119 |url=https://wikipediaquality.com/wiki/Document_Topic_Extraction_based_on_Wikipedia_Category}}
HTML
Yun, Jiali; Jing, Liping; Yu, Jian; Huang, Houkuan; Zhang, Ying. (2011). "<a href="https://wikipediaquality.com/wiki/Document_Topic_Extraction_based_on_Wikipedia_Category">Document Topic Extraction based on Wikipedia Category</a>".DOI: 10.1109/CSO.2011.119.