Using Multilevel Regression and Poststratification to Estimate Physical Activity Levels from Health Surveys

Int J Environ Res Public Health. 2021 Jul 13;18(14):7477. doi: 10.3390/ijerph18147477.

Abstract

Background: Large-scale health surveys often consider sociodemographic characteristics and several health indicators influencing physical activity that often vary across subpopulations. Data in a survey for some small subpopulations are often not representative of the larger population.

Objective: We developed a multilevel regression and poststratification (MRP) model to estimate leisure-time physical activity across Brazilian state capitals and evaluated whether the MRP outperforms single-level regression estimates based on the Brazilian cross-sectional national survey VIGITEL (2018).

Methods: We used various approaches to compare the MRP and single-level model (complete-pooling) estimates, including cross-validation with various subsample proportions tested.

Results: MRP consistently had predictions closer to the estimation target than single-level regression estimations. The mean absolute errors were smaller for the MRP estimates than single-level regression estimates with smaller sample sizes. MRP presented substantially smaller uncertainty estimates compared to single-level regression estimates. Overall, the MRP was superior to single-level regression estimates, particularly with smaller sample sizes, yielding smaller errors and more accurate estimates.

Conclusion: The MRP is a promising strategy to predict subpopulations' physical activity indicators from large surveys. The observations present in this study highlight the need for further research, which could, potentially, incorporate more information in the models to better interpret interactions and types of activities across target populations.

Keywords: Bayesian analysis; public health surveillance; selection bias; statistical models; survey methods.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cross-Sectional Studies
  • Exercise*
  • Health Surveys
  • Humans
  • Multilevel Analysis
  • Sample Size