Modeling geostatistical incomplete spatially correlated survival data with applications to COVID-19 mortality in Ghana

Spat Stat. 2023 Apr:54:100730. doi: 10.1016/j.spasta.2023.100730. Epub 2023 Feb 20.

Abstract

Survival models which incorporate frailties are common in time-to-event data collected over distinct spatial regions. While incomplete data are unavoidable and a common complication in statistical analysis of spatial survival research, most researchers still ignore the missing data problem. In this paper, we propose a geostatistical modeling approach for incomplete spatially correlated survival data. We achieve this by exploring missingness in outcome, covariates, and spatial locations. In the process, we analyze incomplete spatially-referenced survival data using a Weibull model for the baseline hazard function and correlated log-Gaussian frailties to model spatial correlation. We illustrate the proposed method with simulated data and an application to geo-referenced COVID-19 data from Ghana. There are several disagreements between parameter estimates and credible intervals widths obtained using our proposed approach and complete case analysis. Based on these findings, we argue that our approach provides more reliable parameter estimates and has higher predictive accuracy.

Keywords: Bayesian modeling; COVID-19; Frailties; Incomplete data; Multiple imputation; Spatial survival.