Analyzing spatial aggregation error in statistical models of late-stage cancer risk: a Monte Carlo simulation approach

Int J Health Geogr. 2010 Oct 19:9:51. doi: 10.1186/1476-072X-9-51.

Abstract

Purpose: This paper examines the effect of spatial aggregation error on statistical estimates of the association between spatial access to health care and late-stage cancer.

Methods: Monte Carlo simulation was used to disaggregate cancer cases for two Illinois counties from zip code to census block in proportion to the age-race composition of the block population. After the disaggregation, a hierarchical logistic model was estimated examining the relationship between late-stage breast cancer and risk factors including travel distance to mammography, at both the zip code and census block levels. Model coefficients were compared between the two levels to assess the impact of spatial aggregation error.

Results: We found that spatial aggregation error influences the coefficients of regression-type models at the zip code level, and this impact is highly dependent on the study area. In one study area (Kane County), block-level coefficients were very similar to those estimated on the basis of zip code data; whereas in the other study area (Peoria County), the two sets of coefficients differed substantially raising the possibility of drawing inaccurate inferences about the association between distance to mammography and late-stage cancer risk.

Conclusions: Spatial aggregation error can significantly affect the coefficient values and inferences drawn from statistical models of the association between cancer outcomes and spatial and non-spatial variables. Relying on data at the zip code level may lead to inaccurate findings on health risk factors.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Aged
  • Bias
  • Breast Neoplasms / epidemiology*
  • Breast Neoplasms / pathology*
  • Censuses
  • Cluster Analysis
  • Computer Simulation
  • Female
  • Geographic Information Systems
  • Health Services Accessibility / statistics & numerical data*
  • Humans
  • Illinois / epidemiology
  • Mammography / statistics & numerical data
  • Middle Aged
  • Models, Statistical*
  • Monte Carlo Method
  • Neoplasm Staging
  • Registries