Automatic Quality Assessment of Content Created Collaboratively by Web Communities: A Case Study of Wikipedia

From Wikipedia Quality
Jump to: navigation, search
Automatic Quality Assessment of Content Created Collaboratively by Web Communities: A Case Study of Wikipedia
Authors
Daniel Hasan Dalip
Marcos Andre Goncalves
Marco Antônio Pinheiro de Cristo
Pável Pereira Calado
Publication date
2009
ISSN
15525996
ISBN
978-160558697-7
DOI
10.1145/1555400.1555449
Links
Original

Automatic Quality Assessment of Content Created Collaboratively by Web Communities: A Case Study of Wikipedia - scientific work about Wikipedia quality published in 2009, written by Daniel Hasan Dalip, Marcos Andre Goncalves, Marco Antônio Pinheiro de Cristo and Pável Pereira Calado.

Overview

The old dream of a universal repository containing all the human knowledge and culture is becoming possible through the Internet and the Web. Moreover, this is happening with the direct collaborative, participation of people. Wikipedia is a great example. It is an enormous repository of information with free access and edition, created by the community in a collaborative manner. However, this large amount of information, made available democratically and virtually without any control, raises questions about its relative quality. In this work authors explore a significant number of quality indicators, some of them proposed by us and used here for the first time, and study their capability to assess the quality of Wikipedia articles. Furthermore, authors explore machine learning techniques to combine these quality indicators into one single assessment judgment. Through experiments, authors show that the most important quality indicators are the easiest ones to extract, namely, textual features related to length, structure and style. Authors were also able to determine which indicators did not contribute significantly to the quality assessment. These were, coincidentally, the most complex features, such as those based on link analysis. Finally, authors compare their combination method with state-of-the-art solution and show significant improvements in terms of effective quality prediction.

Embed

Wikipedia Quality

Dalip, Daniel Hasan; Gonçalves, Marcos A.; Cristo, Marco Antônio Pinheiro; Calado, Pável Pereira. (2009). "[[Automatic Quality Assessment of Content Created Collaboratively by Web Communities: A Case Study of Wikipedia]]". International Conference on Information and Knowledge Management, Proceedings 2009, pp. 215-224. ISBN: 978-160558697-7. ISSN: 15525996. DOI: 10.1145/1555400.1555449.

English Wikipedia

{{cite journal |last1=Dalip |first1=Daniel Hasan |last2=Gonçalves |first2=Marcos A. |last3=Cristo |first3=Marco Antônio Pinheiro |last4=Calado |first4=Pável Pereira |title=Automatic Quality Assessment of Content Created Collaboratively by Web Communities: A Case Study of Wikipedia |date=2009 |isbn=978-160558697-7 |issn=15525996 |doi=10.1145/1555400.1555449 |url=https://wikipediaquality.com/wiki/Automatic_Quality_Assessment_of_Content_Created_Collaboratively_by_Web_Communities:_A_Case_Study_of_Wikipedia |journal=International Conference on Information and Knowledge Management, Proceedings 2009, pp. 215-224}}

HTML

Dalip, Daniel Hasan; Gonçalves, Marcos A.; Cristo, Marco Antônio Pinheiro; Calado, Pável Pereira. (2009). &quot;<a href="https://wikipediaquality.com/wiki/Automatic_Quality_Assessment_of_Content_Created_Collaboratively_by_Web_Communities:_A_Case_Study_of_Wikipedia">Automatic Quality Assessment of Content Created Collaboratively by Web Communities: A Case Study of Wikipedia</a>&quot;. International Conference on Information and Knowledge Management, Proceedings 2009, pp. 215-224. ISBN: 978-160558697-7. ISSN: 15525996. DOI: 10.1145/1555400.1555449.