Generating Health Estimates by Zip Code: A Semiparametric Small Area Estimation Approach Using the California Health Interview Survey

Am J Public Health. 2015 Dec;105(12):2534-40. doi: 10.2105/AJPH.2015.302810.

Abstract

Objectives: We propose a method to meet challenges in generating health estimates for granular geographic areas in which the survey sample size is extremely small.

Methods: Our generalized linear mixed model predicts health outcomes using both individual-level and neighborhood-level predictors. The model's feature of nonparametric smoothing function on neighborhood-level variables better captures the association between neighborhood environment and the outcome. Using 2011 to 2012 data from the California Health Interview Survey, we demonstrate an empirical application of this method to estimate the fraction of residents without health insurance for Zip Code Tabulation Areas (ZCTAs).

Results: Our method generated stable estimates of uninsurance for 1519 of 1765 ZCTAs (86%) in California. For some areas with great socioeconomic diversity across adjacent neighborhoods, such as Los Angeles County, the modeled uninsured estimates revealed much heterogeneity among geographically adjacent ZCTAs.

Conclusions: The proposed method can increase the value of health surveys by providing modeled estimates for health data at a granular geographic level. It can account for variations in health outcomes at the neighborhood level as a result of both socioeconomic characteristics and geographic locations.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • California / epidemiology
  • Health Status
  • Health Surveys / methods*
  • Humans
  • Interviews as Topic
  • Models, Statistical
  • Reproducibility of Results
  • Statistics as Topic