High-Precision Person Name Extraction from Turkish Texts Using Wikipedia
In this paper, authors focus on person name extraction from diverse text types in Turkish and have compiled a large set of person names from Turkish Wikipedia. After automated post-processing to clean and extend it, authors have performed extraction experiments using this resource on data sets of considerable sizes and achieved high precision rates. Next, authors have shown that the use of non-local dependencies together with this Wikipedia resource improves recall, and hence F-Measure, considerably. Finally, authors have tested the contribution of the resource and the scheme based on non-local dependencies to the person name extraction performance of a full-fledged named entity recognizer.