Difference between revisions of "Building Semantic Kernels for Text Classification Using Wikipedia"

From Wikipedia Quality
Jump to: navigation, search
(Embed for English Wikipedia, HTML)
(cats.)
 
Line 33: Line 33:
 
</nowiki>
 
</nowiki>
 
</code>
 
</code>
 +
 +
 +
 +
[[Category:Scientific works]]

Latest revision as of 12:56, 19 October 2019


Building Semantic Kernels for Text Classification Using Wikipedia
Authors
Pu Wang
Carlotta Domeniconi
Publication date
2008
DOI
10.1145/1401890.1401976
Links
Original Preprint

Building Semantic Kernels for Text Classification Using Wikipedia - scientific work related to Wikipedia quality published in 2008, written by Pu Wang and Carlotta Domeniconi.

Overview

Document classification presents difficult challenges due to the sparsity and the high dimensionality of text data, and to the complex semantics of the natural language. The traditional document representation is a word-based vector (Bag of Words, or BOW), where each dimension is associated with a term of the dictionary containing all the words that appear in the corpus. Although simple and commonly used, this representation has several limitations. It is essential to embed semantic information and conceptual patterns in order to enhance the prediction capabilities of classification algorithms. In this paper, authors overcome the shortages of the BOW approach by embedding background knowledge derived from Wikipedia into a semantic kernel, which is then used to enrich the representation of documents. Authors empirical evaluation with real data sets demonstrates that approach successfully achieves improved classification accuracy with respect to the BOW technique, and to other recently developed methods.

Embed

Wikipedia Quality

Wang, Pu; Domeniconi, Carlotta. (2008). "[[Building Semantic Kernels for Text Classification Using Wikipedia]]".DOI: 10.1145/1401890.1401976.

English Wikipedia

{{cite journal |last1=Wang |first1=Pu |last2=Domeniconi |first2=Carlotta |title=Building Semantic Kernels for Text Classification Using Wikipedia |date=2008 |doi=10.1145/1401890.1401976 |url=https://wikipediaquality.com/wiki/Building_Semantic_Kernels_for_Text_Classification_Using_Wikipedia}}

HTML

Wang, Pu; Domeniconi, Carlotta. (2008). &quot;<a href="https://wikipediaquality.com/wiki/Building_Semantic_Kernels_for_Text_Classification_Using_Wikipedia">Building Semantic Kernels for Text Classification Using Wikipedia</a>&quot;.DOI: 10.1145/1401890.1401976.