Wikitrends: Unstructured Wikipedia-Based Text Analytics Framework

From Wikipedia Quality
Revision as of 05:31, 10 June 2019 by Audrey (talk | contribs) (Wikitrends: Unstructured Wikipedia-Based Text Analytics Framework - new page)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Wikitrends: Unstructured Wikipedia-Based Text Analytics Framework - scientific work related to Wikipedia quality published in 2017, written by Michel Naim Gerguis, Cherif Salama and M. Watheq El-Kharashi.

Overview

WikiTrends is a new analytics framework for Wikipedia articles. It adds the temporal/spatial dimensions to Wikipedia to visualize the extracted information converting the big static encyclopedia to a vibrant one by enabling the generation of aggregated views in timelines or heat maps for any user-defined collection from unstructured text. Data mining techniques were applied to detect the location, start and end year of existence, gender, and entity class for 4.85 million pages. Authors evaluated extractors over a small manually tagged random set of articles. Heat maps of notable football players’ counts over history or dominant occupations in some specific era are samples of WikiTrends maps while timelines can easily illustrate interesting fame battles over history between male and female actors, music genres, or even between American, Italian, and Indian films. Through information visualization and simple configurations, WikiTrends starts a new experience in answering questions through a figure.