Difference between revisions of "Using Links to Classify Wikipedia Pages"

From Wikipedia Quality
Jump to: navigation, search
(Overview: Using Links to Classify Wikipedia Pages)
 
(wikilinks)
Line 1: Line 1:
'''Using Links to Classify Wikipedia Pages''' - scientific work related to Wikipedia quality published in 2009, written by Rianne Kaptein and Jaap Kamps.
+
'''Using Links to Classify Wikipedia Pages''' - scientific work related to [[Wikipedia quality]] published in 2009, written by [[Rianne Kaptein]] and [[Jaap Kamps]].
  
 
== Overview ==
 
== Overview ==
 
This paper contains a description of experiments for the 2008 INEX XML-mining track. Authors goal for the XML-mining track is to explore whether authors can use link information to improve classification accuracy. Authors approach is to propagate category probabilities over linked pages. Authors find that using link information leads to marginal improvements over a baseline that uses a Naive Bayes model. For the initially misclassified pages, link information is either not available or contains too much noise.
 
This paper contains a description of experiments for the 2008 INEX XML-mining track. Authors goal for the XML-mining track is to explore whether authors can use link information to improve classification accuracy. Authors approach is to propagate category probabilities over linked pages. Authors find that using link information leads to marginal improvements over a baseline that uses a Naive Bayes model. For the initially misclassified pages, link information is either not available or contains too much noise.

Revision as of 23:36, 18 July 2019

Using Links to Classify Wikipedia Pages - scientific work related to Wikipedia quality published in 2009, written by Rianne Kaptein and Jaap Kamps.

Overview

This paper contains a description of experiments for the 2008 INEX XML-mining track. Authors goal for the XML-mining track is to explore whether authors can use link information to improve classification accuracy. Authors approach is to propagate category probabilities over linked pages. Authors find that using link information leads to marginal improvements over a baseline that uses a Naive Bayes model. For the initially misclassified pages, link information is either not available or contains too much noise.