Using POI and multisource satellite datasets for mainland China's population spatialization and spatiotemporal changes based on regional heterogeneity

Sci Total Environ. 2024 Feb 20:912:169499. doi: 10.1016/j.scitotenv.2023.169499. Epub 2023 Dec 19.

Abstract

Geospatial big data and remote sensing data are widely used in population spatialization studies. However, the relationship between them and population distribution has regional heterogeneity in different geographic contexts. It is necessary to improve data processing methods and spatialization models in areas with large geographical differences. We used land cover data to extract human activity, nighttime light and point-of-interest (POI) data to represent human activity intensity, and considered differences in geographical context to divide mainland China into northern, southern and western regions. We constructed random forest models to generate gridded population distribution datasets with a resolution of 500 m, and quantitatively evaluated the importance of auxiliary data in different geographical contexts. The street-level accuracy assessment showed that our population dataset is more accurate than WorldPop, with a higher R2 and smaller deviation. The improved datasets provided broad potential for exploring the spatial-temporal changes in grid-level population distribution in China from 2010 to 2020. The results indicated that the population density and settlement area have increased, and the overall pattern of population distribution has remained highly stable, but there are significant differences in population change patterns among cities with different urbanization processes. The importance of the ancillary data to the models varied significantly, with POI contributing the most to the southern region and the least to the western region. Moreover, POI had relatively less influence on model improvement in undeveloped areas. Our study could provide a reference for predicting social and economic spatialized data in different geographical context areas using POI and multisource satellite data.

Keywords: Point of interest; Population spatialization; Random forest; Regional heterogeneity; Spatial-temporal pattern.