Visual and Predictive Analytics on Singapore News: Experiments on Gdelt, Wikipedia, and ^Sti

From Wikipedia Quality
Revision as of 22:55, 21 May 2019 by Sofia (talk | contribs) (Overview + infobox - Visual and Predictive Analytics on Singapore News: Experiments on Gdelt, Wikipedia, and ^Sti)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search


Visual and Predictive Analytics on Singapore News: Experiments on Gdelt, Wikipedia, and ^Sti
Authors
Clifton Phua
Yuzhang Feng
Junyao Ji
Timothy Soh
Publication date
2014
Links
Original Preprint

Visual and Predictive Analytics on Singapore News: Experiments on Gdelt, Wikipedia, and ^Sti - scientific work related to Wikipedia quality published in 2014, written by Clifton Phua, Yuzhang Feng, Junyao Ji and Timothy Soh.

Overview

The open-source Global Database of Events, Language, and Tone (GDELT) is the most comprehensive and updated Big Data source of important terms extracted from international news articles . Authors focus only on GDELT's Singapore events to better understand the data quality of its news articles, accuracy of its term extraction, and potential for prediction. To test news completeness and validity, authors visually compared GDELT (Singapore news articles' terms from 1979 to 2013) to Wikipedia's timeline of Singaporean history. To test term extraction accuracy, authors visually compared GDELT (CAMEO codes and TABARI system of extraction from Singapore news articles' text from April to December 2013) to SAS Text Miner's term and topic extraction. To perform predictive analytics, authors propose a novel feature engineering method to transform row-level GDELT from articles to a user-specified temporal resolution. For example, authors apply a decision tree using daily counts of feature values from GDELT to predict Singapore stock market's Straits Times Index (^STI). Of practical interest from the above results is SAS Visual Analytics' ability to highlight the various impacts of June 2013 Southeast Asian haze and December 2013 Little India riot on Singapore. Although Singapore is unique as a sovereign city-state, a leading financial centre, has strong international influence, and consists of a highly multi-cultural population, the visual and predictive analytics reported here are highly applicable to another country's GDELT data.

Embed

Wikipedia Quality

Phua, Clifton; Feng, Yuzhang; Ji, Junyao; Soh, Timothy. (2014). "[[Visual and Predictive Analytics on Singapore News: Experiments on Gdelt, Wikipedia, and ^Sti]]".

English Wikipedia

{{cite journal |last1=Phua |first1=Clifton |last2=Feng |first2=Yuzhang |last3=Ji |first3=Junyao |last4=Soh |first4=Timothy |title=Visual and Predictive Analytics on Singapore News: Experiments on Gdelt, Wikipedia, and ^Sti |date=2014 |url=https://wikipediaquality.com/wiki/Visual_and_Predictive_Analytics_on_Singapore_News:_Experiments_on_Gdelt,_Wikipedia,_and_^Sti}}

HTML

Phua, Clifton; Feng, Yuzhang; Ji, Junyao; Soh, Timothy. (2014). &quot;<a href="https://wikipediaquality.com/wiki/Visual_and_Predictive_Analytics_on_Singapore_News:_Experiments_on_Gdelt,_Wikipedia,_and_^Sti">Visual and Predictive Analytics on Singapore News: Experiments on Gdelt, Wikipedia, and ^Sti</a>&quot;.