Difference between revisions of "Multi-Label Wikipedia Classification with Textual and Link Features"

From Wikipedia Quality
Jump to: navigation, search
(Wikilinks)
(+ Infobox work)
Line 1: Line 1:
 +
{{Infobox work
 +
| title = Multi-Label Wikipedia Classification with Textual and Link Features
 +
| date = 2009
 +
| authors = [[Boris Chidlovskii]]
 +
| doi = 10.1007/978-3-642-14556-8_38
 +
| link = https://dl.acm.org/citation.cfm?id=1881065.1881112
 +
}}
 
'''Multi-Label Wikipedia Classification with Textual and Link Features''' - scientific work related to [[Wikipedia quality]] published in 2009, written by [[Boris Chidlovskii]].
 
'''Multi-Label Wikipedia Classification with Textual and Link Features''' - scientific work related to [[Wikipedia quality]] published in 2009, written by [[Boris Chidlovskii]].
  
 
== Overview ==
 
== Overview ==
 
Authors address the problem of categorizing a large set of linked documents with important content and structure aspects, in particular, from the [[Wikipedia]] collection proposed at the INEX 2009 XML Mining challenge. Authors analyze the network of collection pages and turn it into valuable [[features]] for the classification. Authors combine the content-based and link-based features of pages to train an accurate categorizer for unlabelled pages. In the multi-label setting, authors revise a number of existing techniques and test some which show a good scalability. Authors report evaluation results obtained with a variety of learning methods and techniques on the training set of the Wikipedia corpus.
 
Authors address the problem of categorizing a large set of linked documents with important content and structure aspects, in particular, from the [[Wikipedia]] collection proposed at the INEX 2009 XML Mining challenge. Authors analyze the network of collection pages and turn it into valuable [[features]] for the classification. Authors combine the content-based and link-based features of pages to train an accurate categorizer for unlabelled pages. In the multi-label setting, authors revise a number of existing techniques and test some which show a good scalability. Authors report evaluation results obtained with a variety of learning methods and techniques on the training set of the Wikipedia corpus.

Revision as of 11:44, 8 November 2019


Multi-Label Wikipedia Classification with Textual and Link Features
Authors
Boris Chidlovskii
Publication date
2009
DOI
10.1007/978-3-642-14556-8_38
Links
Original

Multi-Label Wikipedia Classification with Textual and Link Features - scientific work related to Wikipedia quality published in 2009, written by Boris Chidlovskii.

Overview

Authors address the problem of categorizing a large set of linked documents with important content and structure aspects, in particular, from the Wikipedia collection proposed at the INEX 2009 XML Mining challenge. Authors analyze the network of collection pages and turn it into valuable features for the classification. Authors combine the content-based and link-based features of pages to train an accurate categorizer for unlabelled pages. In the multi-label setting, authors revise a number of existing techniques and test some which show a good scalability. Authors report evaluation results obtained with a variety of learning methods and techniques on the training set of the Wikipedia corpus.