Semi-Supervised Categorization of Wikipedia Collection By Label Expansion
Authors | Boris Chidlovskii |
---|---|
Publication date | 2009 |
DOI | 10.1007/978-3-642-03761-0_42 |
Links | Original Preprint |
Semi-Supervised Categorization of Wikipedia Collection By Label Expansion - scientific work related to Wikipedia quality published in 2009, written by Boris Chidlovskii.
Overview
Authors address the problem of categorizing a large set of linked documents with important content and structure aspects, for example, from Wikipedia collection proposed at the INEX XML Mining track. Authors cope with the case where there is a small number of labeled pages and a very large number of unlabeled ones. Due to the sparsity of the link based structure of Wikipedia, authors apply the spectral and graph-based techniques developed in the semi-supervised machine learning. Authors use the content and structure views of Wikipedia collection to build a transductive categorizer for the unlabeled pages. Authors report evaluation results obtained with the label propagation function which ensures a good scalability on sparse graphs.
Embed
Wikipedia Quality
Chidlovskii, Boris. (2009). "[[Semi-Supervised Categorization of Wikipedia Collection By Label Expansion]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-03761-0_42.
English Wikipedia
{{cite journal |last1=Chidlovskii |first1=Boris |title=Semi-Supervised Categorization of Wikipedia Collection By Label Expansion |date=2009 |doi=10.1007/978-3-642-03761-0_42 |url=https://wikipediaquality.com/wiki/Semi-Supervised_Categorization_of_Wikipedia_Collection_By_Label_Expansion |journal=Springer, Berlin, Heidelberg}}
HTML
Chidlovskii, Boris. (2009). "<a href="https://wikipediaquality.com/wiki/Semi-Supervised_Categorization_of_Wikipedia_Collection_By_Label_Expansion">Semi-Supervised Categorization of Wikipedia Collection By Label Expansion</a>". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-03761-0_42.