Difference between revisions of "Exploiting Negative Categories and Wikipedia Structures for Document Classification"

From Wikipedia Quality
Jump to: navigation, search
(New study: Exploiting Negative Categories and Wikipedia Structures for Document Classification)
 
(wikilinks)
Line 1: Line 1:
'''Exploiting Negative Categories and Wikipedia Structures for Document Classification''' - scientific work related to Wikipedia quality published in 2009, written by Meenakshi Sundaram Murugeshan, K. Lakshmi and Saswati Mukherjee.
+
'''Exploiting Negative Categories and Wikipedia Structures for Document Classification''' - scientific work related to [[Wikipedia quality]] published in 2009, written by [[Meenakshi Sundaram Murugeshan]], [[K. Lakshmi]] and [[Saswati Mukherjee]].
  
 
== Overview ==
 
== Overview ==
This paper explores the effect of profile based method for classification of Wikipedia XML documents. Authors approach builds two profiles, exploiting the whole content, Initial Descriptions and links in the Wikipedia documents. For building profiles authors use the negative category information which has shown to perform well for classifying unstructured texts. The performance of Cosine and Fractional Similarity metrics is also compared. The use of two classifiers and their weighted average improves the classification performance.
+
This paper explores the effect of profile based method for classification of [[Wikipedia]] XML documents. Authors approach builds two profiles, exploiting the whole content, Initial Descriptions and links in the Wikipedia documents. For building profiles authors use the negative category information which has shown to perform well for classifying unstructured texts. The performance of Cosine and Fractional Similarity metrics is also compared. The use of two classifiers and their weighted average improves the classification performance.

Revision as of 08:32, 6 June 2019

Exploiting Negative Categories and Wikipedia Structures for Document Classification - scientific work related to Wikipedia quality published in 2009, written by Meenakshi Sundaram Murugeshan, K. Lakshmi and Saswati Mukherjee.

Overview

This paper explores the effect of profile based method for classification of Wikipedia XML documents. Authors approach builds two profiles, exploiting the whole content, Initial Descriptions and links in the Wikipedia documents. For building profiles authors use the negative category information which has shown to perform well for classifying unstructured texts. The performance of Cosine and Fractional Similarity metrics is also compared. The use of two classifiers and their weighted average improves the classification performance.