A Bayesian Approach to Account for Misclassification and Overdispersion in Count Data

Int J Environ Res Public Health. 2015 Aug 28;12(9):10648-61. doi: 10.3390/ijerph120910648.

Abstract

Count data are subject to considerable sources of what is often referred to as non-sampling error. Errors such as misclassification, measurement error and unmeasured confounding can lead to substantially biased estimators. It is strongly recommended that epidemiologists not only acknowledge these sorts of errors in data, but incorporate sensitivity analyses into part of the total data analysis. We extend previous work on Poisson regression models that allow for misclassification by thoroughly discussing the basis for the models and allowing for extra-Poisson variability in the form of random effects. Via simulation we show the improvements in inference that are brought about by accounting for both the misclassification and the overdispersion.

Keywords: count data; misclassification; overdispersion.

MeSH terms

  • Bayes Theorem*
  • Epidemiologic Methods*
  • Models, Theoretical*
  • Poisson Distribution*