High-Precision Person Name Extraction from Turkish Texts Using Wikipedia

From Wikipedia Quality
Revision as of 11:11, 8 November 2019 by Agnieszka (talk | contribs) (wikilinks)
Jump to: navigation, search

High-Precision Person Name Extraction from Turkish Texts Using Wikipedia - scientific work related to Wikipedia quality published in 2015, written by Dilek Küçük and Doğan Küçük.

Overview

In this paper, authors focus on person name extraction from diverse text types in Turkish and have compiled a large set of person names from Turkish Wikipedia. After automated post-processing to clean and extend it, authors have performed extraction experiments using this resource on data sets of considerable sizes and achieved high precision rates. Next, authors have shown that the use of non-local dependencies together with this Wikipedia resource improves recall, and hence F-Measure, considerably. Finally, authors have tested the contribution of the resource and the scheme based on non-local dependencies to the person name extraction performance of a full-fledged named entity recognizer.