A simulation study for geographic cluster detection analysis on population-based health survey data using spatial scan statistics

Int J Health Geogr. 2022 Sep 9;21(1):11. doi: 10.1186/s12942-022-00311-6.

Abstract

Background: In public health and epidemiology, spatial scan statistics can be used to identify spatial cluster patterns of health-related outcomes from population-based health survey data. Although it is appropriate to consider the complex sample design and sampling weight when analyzing complex sample survey data, the observed survey responses without these considerations are often used in many studies related to spatial cluster detection.

Methods: We conducted a simulation study to investigate which data type from complex survey data is more suitable for use by comparing the spatial cluster detection results of three approaches: (1) individual-level data, (2) weighted individual-level data, and (3) aggregated data.

Results: The results of the spatial cluster detection varied depending on the data type. To compare the performance of spatial cluster detection, sensitivity and positive predictive value (PPV) were evaluated over 100 iterations. The average sensitivity was high for all three approaches, but the average PPV was higher when using aggregated data than when using individual-level data with or without sampling weights.

Conclusions: Through the simulation study, we found that use of aggregate-level data is more appropriate than other types of data, when searching for spatial clusters using spatial scan statistics on population-based health survey data.

Keywords: Geographic surveillance; Health survey; Sampling design; Sampling weight; Spatial cluster detection.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • Computer Simulation
  • Health Surveys
  • Humans
  • Public Health*
  • Research Design*