Difference between revisions of "Comparative Analysis of Classification Models for Quality Assessment of Wikipedia Articles"
Line 1: | Line 1: | ||
{{Infobox work | {{Infobox work | ||
− | |title = Comparative | + | |title = Comparative Analysis of Classification Models for Quality Assessment of Wikipedia Articles |
|date = 2017 | |date = 2017 | ||
|authors = [[Włodzimierz Lewoniewski]]<br />[[Krzysztof Węcel]]<br />[[Witold Abramowicz]] | |authors = [[Włodzimierz Lewoniewski]]<br />[[Krzysztof Węcel]]<br />[[Witold Abramowicz]] | ||
Line 6: | Line 6: | ||
|plink=https://www.researchgate.net/publication/317544122_Analiza_porownawcza_modeli_klasyfikacyjnych_w_kontekscie_oceny_jakosci_artykulow_Wikipedii_Comparative_analysis_of_classification_models_for_quality_assessment_of_Wikipedia_articles | |plink=https://www.researchgate.net/publication/317544122_Analiza_porownawcza_modeli_klasyfikacyjnych_w_kontekscie_oceny_jakosci_artykulow_Wikipedii_Comparative_analysis_of_classification_models_for_quality_assessment_of_Wikipedia_articles | ||
}} | }} | ||
− | '''Comparative | + | '''Comparative Analysis of Classification Models for Quality Assessment of Wikipedia Articles''' - scientific work about [[Wikipedia quality]] published in 2017, written by [[Włodzimierz Lewoniewski]], [[Krzysztof Węcel]] and [[Witold Abramowicz]] |
== Overview == | == Overview == |
Latest revision as of 21:17, 20 June 2018
Authors | Włodzimierz Lewoniewski Krzysztof Węcel Witold Abramowicz |
---|---|
Publication date | 2017 |
ISBN | 9788374179386 |
Links | Preprint |
Comparative Analysis of Classification Models for Quality Assessment of Wikipedia Articles - scientific work about Wikipedia quality published in 2017, written by Włodzimierz Lewoniewski, Krzysztof Węcel and Witold Abramowicz
Overview
In this paper authors compare the suitability of various classification models (including CART, random forest, boosting trees, C4.5, C5.0, SVM, neural networks) for automatic assessment of the quality of articles in seven language editions of Wikipedia (Belarussian, German, English, French, Polish, Russian, Ukrainian). Authors employed models available in STATISTICA, WEKA and R Studio. For the classification task authors used over 80 different features of the articles, elaborated based on state of the art analysis and our own experience. Authors also carried out a comparative analysis regarding the significance of the parameters having an impact on the quality of the papers in each language.