Portability of semantic and spatial-temporal machine learning methods to analyse social media for near-real-time disaster monitoring

Nat Hazards (Dordr). 2021;108(3):2939-2969. doi: 10.1007/s11069-021-04808-4. Epub 2021 Jul 10.

Abstract

Up-to-date information about an emergency is crucial for effective disaster management. However, severe restrictions impede the creation of spatiotemporal information by current remote sensing-based monitoring systems, especially at the beginning of a disaster. Multiple publications have shown promising results in complementing monitoring systems through spatiotemporal information extracted from social media data. However, various monitoring system criteria, such as near-real-time capabilities or applicability for different disaster types and use cases, have not yet been addressed. This paper presents an improved version of a recently proposed methodology to identify disaster-impacted areas (hot spots and cold spots) by combining semantic and geospatial machine learning methods. The process of identifying impacted areas is automated using semi-supervised topic models for various kinds of natural disasters. We validated the portability of our approach through experiments with multiple natural disasters and disaster types with differing characteristics, whereby one use case served to prove the near-real-time capability of our approach. We demonstrated the validity of the produced information by comparing the results with official authority datasets provided by the United States Geological Survey and the National Hurricane Centre. The validation shows that our approach produces reliable results that match the official authority datasets. Furthermore, the analysis result values are shown and compared to the outputs of the remote sensing-based Copernicus Emergency Management Service. The information derived from different sources can thus be considered to reliably detect disaster-impacted areas that were not detected by the Copernicus Emergency Management Service, particularly in densely populated cities.

Keywords: Disaster management; Geospatial analysis; Machine learning; Semantic topic analysis; Social media.