Aligning Sentences from Standard Wikipedia to Simple Wikipedia

From Wikipedia Quality
Revision as of 10:41, 3 August 2019 by Sophie (talk | contribs) (Aligning Sentences from Standard Wikipedia to Simple Wikipedia - creating a new article)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Aligning Sentences from Standard Wikipedia to Simple Wikipedia - scientific work related to Wikipedia quality published in 2015, written by William D. Hwang, Hannaneh Hajishirzi, Mari Ostendorf and Wei Wu.

Overview

This work improves monolingual sentence alignment for text simplification, specifically for text in standard and simple Wikipedia. Authors introduce a method that improves over past efforts by using a greedy (vs. ordered) search over the document and a word-level semantic similarity score based on Wiktionary (vs. WordNet) that also accounts for structural similarity through syntactic dependencies. Experiments show improved performance on a hand-aligned set, with the largest gain coming from structural similarity. Resulting datasets of manually and automatically aligned sentence pairs are made available.