News Auto-Tagging Using Wikipedia

From Wikipedia Quality
Revision as of 19:58, 10 July 2019 by Kaylee (talk | contribs) (Links)
Jump to: navigation, search

News Auto-Tagging Using Wikipedia - scientific work related to Wikipedia quality published in 2013, written by Shaimaa Shams Eldin and Samhaa R. El-Beltagy.

Overview

This paper presents an efficient method for automatically annotating Arabic news stories with tags using Wikipedia. The idea of the system is to use Wikipedia article names, properties, and re-directs to build a pool of meaningful tags. Sophisticated and efficient matching methods are then used to detect text fragments in input news stories that correspond to entries in the constructed tag pool. Generated tags represent real life entities or concepts such as the names of popular places, known organizations, celebrities, etc. These tags can be used indirectly by a news site for indexing, clustering, classification, statistics generation or directly to give a news reader an overview of news story contents. Evaluation of the system has shown that the tags it generates are better than those generated by MSN Arabic news.