Learning Simple Wikipedia: a Cogitation in Ascertaining Abecedarian Language

From Wikipedia Quality
Jump to: navigation, search


Learning Simple Wikipedia: a Cogitation in Ascertaining Abecedarian Language
Authors
Courtney Napoles
Mark Dredze
Publication date
2010
Links
Original

Learning Simple Wikipedia: a Cogitation in Ascertaining Abecedarian Language - scientific work related to Wikipedia quality published in 2010, written by Courtney Napoles and Mark Dredze.

Overview

Text simplification is the process of changing vocabulary and grammatical structure to create a more accessible version of the text while maintaining the underlying information and content. Automated tools for text simplification are a practical way to make large corpora of text accessible to a wider audience lacking high levels of fluency in the corpus language. In this work, authors investigate the potential of Simple Wikipedia to assist automatic text simplification by building a statistical classification system that discriminates simple English from ordinary English. Most text simplification systems are based on hand-written rules (e.g., PEST (Carroll et al., 1999) and its module SYSTAR (Canning et al., 2000)), and therefore face limitations scaling and transferring across domains. The potential for using Simple Wikipedia for text simplification is significant; it contains nearly 60,000 articles with revision histories and aligned articles to ordinary English Wikipedia. Using articles from Simple Wikipedia and ordinary Wikipedia, authors evaluated different classifiers and feature sets to identify the most discriminative features of simple English for use across domains. These findings help further understanding of what makes text simple and can be applied as a tool to help writers craft simple text.

Embed

Wikipedia Quality

Napoles, Courtney; Dredze, Mark. (2010). "[[Learning Simple Wikipedia: a Cogitation in Ascertaining Abecedarian Language]]". Association for Computational Linguistics.

English Wikipedia

{{cite journal |last1=Napoles |first1=Courtney |last2=Dredze |first2=Mark |title=Learning Simple Wikipedia: a Cogitation in Ascertaining Abecedarian Language |date=2010 |url=https://wikipediaquality.com/wiki/Learning_Simple_Wikipedia:_a_Cogitation_in_Ascertaining_Abecedarian_Language |journal=Association for Computational Linguistics}}

HTML

Napoles, Courtney; Dredze, Mark. (2010). &quot;<a href="https://wikipediaquality.com/wiki/Learning_Simple_Wikipedia:_a_Cogitation_in_Ascertaining_Abecedarian_Language">Learning Simple Wikipedia: a Cogitation in Ascertaining Abecedarian Language</a>&quot;. Association for Computational Linguistics.