For the Sake of Simplicity: Unsupervised Extraction of Lexical Simplifications from Wikipedia

From Wikipedia Quality
Jump to: navigation, search
For the Sake of Simplicity: Unsupervised Extraction of Lexical Simplifications from Wikipedia
Authors
Mark Yatskar
Bo Pang
Cristian Danescu-Niculescu-mizil
Lillian Lee
Publication date
2010
ISBN
1932432655;978-193243265-7
Links

For the Sake of Simplicity: Unsupervised Extraction of Lexical Simplifications from Wikipedia - scientific work about Wikipedia quality published in 2010, written by Mark Yatskar, Bo Pang, Cristian Danescu-Niculescu-mizil and Lillian Lee.

Overview

Autors report on work in progress on extracting lexical simplifications (e.g., "collaborate" → "work together"), focusing on utilizing edit histories in Simple English Wikipedia for this task. Authors consider two main approaches: (1) deriving simplification probabilities via an edit model that accounts for a mixture of different operations, and (2) using metadata to focus on edits that are more likely to be simplification operations. Authors find their methods to outperform a reasonable baseline and yield many high-quality lexical simplifications not included in an independently-created manually prepared list.

Embed

Wikipedia Quality

Yatskar, Mark; Pang, Bo; Danescu-Niculescu-mizil, Cristian; Lee, Lillian. (2010). "[[For the Sake of Simplicity: Unsupervised Extraction of Lexical Simplifications from Wikipedia]]". International Conference on Information and Knowledge Management, Proceedings 2010, pp. 929-938. ISBN: 1932432655;978-193243265-7.

English Wikipedia

{{cite journal |last1=Yatskar |first1=Mark |last2=Pang |first2=Bo |last3=Danescu-Niculescu-mizil |first3=Cristian |last4=Lee |first4=Lillian |title=For the Sake of Simplicity: Unsupervised Extraction of Lexical Simplifications from Wikipedia |date=2010 |isbn=1932432655;978-193243265-7 |url=https://wikipediaquality.com/wiki/For_the_Sake_of_Simplicity:_Unsupervised_Extraction_of_Lexical_Simplifications_from_Wikipedia |journal=International Conference on Information and Knowledge Management, Proceedings 2010, pp. 929-938}}

HTML

Yatskar, Mark; Pang, Bo; Danescu-Niculescu-mizil, Cristian; Lee, Lillian. (2010). &quot;<a href="https://wikipediaquality.com/wiki/For_the_Sake_of_Simplicity:_Unsupervised_Extraction_of_Lexical_Simplifications_from_Wikipedia">For the Sake of Simplicity: Unsupervised Extraction of Lexical Simplifications from Wikipedia</a>&quot;. International Conference on Information and Knowledge Management, Proceedings 2010, pp. 929-938. ISBN: 1932432655;978-193243265-7.