Assessing spatial confounding in cancer disease mapping using R

Cancer Rep (Hoboken). 2020 Aug;3(4):e1263. doi: 10.1002/cnr2.1263. Epub 2020 Jul 28.

Abstract

Background: Exploring spatial patterns in the context of cancer disease mapping (DM) is a decisive approach to bring evidence of geographical tendencies in assessing disease status and progression. However, this framework is not insulated from spatial confounding, a topic of significant interest in cancer epidemiology, where the latent correlation between the spatial random effects and fixed effects (such as covariates), often lead to misleading interpretation.

Aims: To introduce three popular approaches (RHZ, HH and SPOCK; details in paper) often employed to tackle spatial confounding, and illustrate their implementation in cancer research via the popular statistical software R.

Methods: As a solution to alleviate spatial confounding, restricted spatial regressions are constructed by either projecting the latent effect onto the orthogonal space of covariates, or by displacing the spatial locations. Popular parametric count data models, such as the Poisson, generalized Poisson and negative binomial, were considered for the areal count responses, while the spatial association is quantified via the conditional autoregressive (CAR) model. Our method of inference in Bayesian, sometimes aided by the integrated nested Laplace approximation (INLA) to accelerate computing. The methods are implemented in the R package RASCO available from the first author's GitHub page.

Results: The results reveal that all three methods perform well in alleviating the bias and variance inflation present in the spatial models. The effects of spatial confounding were also explored, which, if ignored in practice, may lead to wrong conclusions.

Conclusion: Spatial confounding continues to remain a critical bottleneck in deriving precise inference from spatial DM models. Hence, its effects must be investigated, and mitigated. Several approaches are available in the literature, and they produce trustworthy results. The central contribution of this paper is providing the practitioners the R package RASCO, capable of fitting a large number of spatial models, as well as their restricted versions.

Keywords: Bayesian inference; RASCO; areal modeling; integrated nested Laplace approximation; spatial confounding; variance inflation.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binomial Distribution
  • Humans
  • Neoplasms / epidemiology*
  • Poisson Distribution
  • Software*