The geography of power: statistical performance of tests of clusters and clustering in heterogeneous populations

Stat Med. 2006 Mar 15;25(5):853-65. doi: 10.1002/sim.2418.

Abstract

Heterogeneous population densities complicate comparisons of statistical power between hypothesis tests evaluating spatial clusters or clustering of disease. Specifically, the location of a cluster within a heterogeneously distributed population at risk impacts power properties, complicating comparisons of tests, and allowing one to map spatial variations in statistical power for different tests. Such maps provide insight into the overall power of a particular test, and also indicate areas within the study area where tests are more or less likely to detect the same local increase in relative risk. While such maps are largely driven by local sample size, we also find differences due to features of the statistics themselves. We illustrate these concepts using two tests: Tango's index of clustering and the spatial scan statistic. Furthermore, assessments of the accuracy of the 'most likely cluster' involve not only statistical power, but also spatial accuracy in identifying the location of a true underlying cluster. We illustrate these concepts via induction of artificial clusters within the observed incidence of severe cardiac birth defects in Santa Clara County, CA in 1981.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • California / epidemiology
  • Cardiovascular Abnormalities / epidemiology
  • Cluster Analysis*
  • Computer Simulation
  • Congenital Abnormalities / epidemiology
  • Data Interpretation, Statistical*
  • Disease Outbreaks*
  • Humans
  • Infant, Newborn
  • Population Dynamics
  • Water Pollutants, Chemical / adverse effects

Substances

  • Water Pollutants, Chemical