Influencing Indicators and Spatial Variation of Diabetes Mellitus Prevalence in Shandong, China: A Framework for Using Data-Driven and Spatial Methods

Geohealth. 2021 Mar 1;5(3):e2020GH000320. doi: 10.1029/2020GH000320. eCollection 2021 Mar.

Abstract

To control and prevent the risk of diabetes, diabetes studies have identified the need to better understand and evaluate the associations between influencing indicators and the prevalence of diabetes. One constraint has been that influencing indicators have been selected mainly based on subjective judgment and tested using traditional statistical modeling methods. We proposed a framework new to diabetes studies using data-driven and spatial methods to identify the most significant influential determinants of diabetes automatically and estimated their relationships. We used data from diabetes mellitus patients' health insurance records in Shandong province, China, and collected influencing indicators of diabetes prevalence at the county level in the sociodemographic, economic, education, and geographical environment domains. We specified a framework to identify automatically the most influential determinants of diabetes, and then established the relationship between these selected influencing indicators and diabetes prevalence. Our autocorrelation results showed that the diabetes prevalence in 12 Shandong cities was significantly clustered (Moran's I = 0.328, p < 0.01). In total, 17 significant influencing indicators were selected by executing binary linear regressions and lasso regressions. The spatial error regressions in different subgroups were subject to different diabetes indicators. Some positive indicators existed significantly like per capita fruit production and other indicators correlated with diabetes prevalence negatively like the proportion of green space. Diabetes prevalence was mainly subjected to the joint effects of influencing indicators. This framework can help public health officials to inform the implementation of improved treatment and policies to attenuate diabetes diseases.

Keywords: diabetes mellitus; indicators; lasso regression; spatial regression.

Associated data

  • figshare/10.6084/m9.figshare.14061104