A Cloud-Enabled Automatic Disaster Analysis System of Multi-Sourced Data Streams: an Example Synthesizing Social Media, Remote Sensing and Wikipedia Data

From Wikipedia Quality
Revision as of 08:23, 10 October 2019 by Aubree (talk | contribs) (A Cloud-Enabled Automatic Disaster Analysis System of Multi-Sourced Data Streams: an Example Synthesizing Social Media, Remote Sensing and Wikipedia Data - creating a new article)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

A Cloud-Enabled Automatic Disaster Analysis System of Multi-Sourced Data Streams: an Example Synthesizing Social Media, Remote Sensing and Wikipedia Data - scientific work related to Wikipedia quality published in 2017, written by Qunying Huang, Guido Cervone and Guiming Zhang.

Overview

Abstract Social media streams and remote sensing data have emerged as new sources for tracking disaster events, and assessing their damages. Previous studies focus on a case-by-case approach, where a specific event was first chosen and filtering criteria (e.g., keywords, spatiotemporal information) are manually designed and used to retrieve relevant data for disaster analysis. This paper presents a framework that synthesizes multi-sourced data (e.g., social media, remote sensing, Wikipedia, and Web), spatial data mining and text mining technologies to build an architecturally resilient and elastic solution to support disaster analysis of historical and future events. Within the proposed framework, Wikipedia is used as a primary source of different historical disaster events, which are extracted to build an event database. Such a database characterizes the salient spatiotemporal patterns and characteristics of each type of disaster. Additionally, it can provide basic semantics, such as event name (e.g., Hurricane Sandy) and type (e.g., flooding) and spatiotemporal scopes, which are then tuned by the proposed procedures to extract additional information (e.g., hashtags for searching tweets), to query and retrieve relevant social media and remote sensing data for a specific disaster. Besides historical event analysis and pattern mining, the cloud-based framework can also support real-time event tracking and monitoring by providing on-demand and elastic computing power and storage capabilities. A prototype is implemented and tested with data relative to the 2011 Hurricane Sandy and the 2013 Colorado flooding.