Towards Automatic Classification of Wikipedia Content

From Wikipedia Quality
Jump to: navigation, search


Towards Automatic Classification of Wikipedia Content
Authors
Julian Szymański
Publication date
2010
DOI
10.1007/978-3-642-15381-5_13
Links
Original

Towards Automatic Classification of Wikipedia Content - scientific work related to Wikipedia quality published in 2010, written by Julian Szymański.

Overview

Wikipedia - the Free Encyclopedia encounters the problem of proper classification of new articles everyday. The process of assignment of articles to categories is performed manually and it is a time consuming task. It requires knowledge about Wikipedia structure, which is beyond typical editor competence, which leads to human-caused mistakes - omitting or wrong assignments of articles to categories. The article presents application of SVM classifier for automatic classification of documents from The Free Encyclopedia. The classifier application has been tested while using two text representations: inter-documents connections (hyperlinks) and word content. The results of the performed experiments evaluated on hand crafted data show that the Wikipedia classification process can be partially automated. The proposed approach can be used for building a decision support system which suggests editors the best categories that fit new content entered to Wikipedia.

Embed

Wikipedia Quality

Szymański, Julian. (2010). "[[Towards Automatic Classification of Wikipedia Content]]". Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-15381-5_13.

English Wikipedia

{{cite journal |last1=Szymański |first1=Julian |title=Towards Automatic Classification of Wikipedia Content |date=2010 |doi=10.1007/978-3-642-15381-5_13 |url=https://wikipediaquality.com/wiki/Towards_Automatic_Classification_of_Wikipedia_Content |journal=Springer, Berlin, Heidelberg}}

HTML

Szymański, Julian. (2010). &quot;<a href="https://wikipediaquality.com/wiki/Towards_Automatic_Classification_of_Wikipedia_Content">Towards Automatic Classification of Wikipedia Content</a>&quot;. Springer, Berlin, Heidelberg. DOI: 10.1007/978-3-642-15381-5_13.