Harnessing heterogeneity in space with statistically guided meta-learning

Yiqun Xie; Weiye Chen; Erhu He; Xiaowei Jia; Han Bao; Xun Zhou; Rahul Ghosh; Praveen Ravirathinam

doi:10.1007/s10115-023-01847-0

Harnessing heterogeneity in space with statistically guided meta-learning

Knowl Inf Syst. 2023;65(6):2699-2729. doi: 10.1007/s10115-023-01847-0. Epub 2023 Mar 8.

Authors

Yiqun Xie^#¹, Weiye Chen^#¹, Erhu He^#², Xiaowei Jia², Han Bao³, Xun Zhou³, Rahul Ghosh⁴, Praveen Ravirathinam⁴

Affiliations

¹ University of Maryland, College Park, MD USA.
² University of Pittsburgh, Pittsburgh, PA USA.
³ University of Iowa, Iowa City, IA USA.
⁴ University of Minnesota, Minneapolis, MN USA.

^# Contributed equally.

Abstract

Spatial data are ubiquitous, massively collected, and widely used to support critical decision-making in many societal domains, including public health (e.g., COVID-19 pandemic control), agricultural crop monitoring, transportation, etc. While recent advances in machine learning and deep learning offer new promising ways to mine such rich datasets (e.g., satellite imagery, COVID statistics), spatial heterogeneity-an intrinsic characteristic embedded in spatial data-poses a major challenge as data distributions or generative processes often vary across space at different scales, with their spatial extents unknown. Recent studies (e.g., SVANN, spatial ensemble) targeting this difficult problem either require a known space-partitioning as the input, or can only support very limited number of partitions or classes (e.g., two) due to the decrease in training data size and the complexity of analysis. To address these limitations, we propose a model-agnostic framework to automatically transform a deep learning model into a spatial-heterogeneity-aware architecture, where the learning of arbitrary space partitionings is guided by a learning-engaged generalization of multivariate scan statistic and parameters are shared based on spatial relationships. Moreover, we propose a spatial moderator to generalize learned space partitionings to new test regions. Finally, we extend the framework by integrating meta-learning-based training strategies into both spatial transformation and moderation to enhance knowledge sharing and adaptation among different processes. Experiment results on real-world datasets show that the framework can effectively capture flexibly shaped heterogeneous footprints and substantially improve prediction performances.

Keywords: Deep learning; Heterogeneity; Meta-learning; Mobility; Remote sensing; Spatial; Statistics.

© The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.