A random-censoring Poisson model for underreported data

Stat Med. 2017 Dec 30;36(30):4873-4892. doi: 10.1002/sim.7456. Epub 2017 Oct 24.

Abstract

A major challenge when monitoring risks in socially deprived areas of under developed countries is that economic, epidemiological, and social data are typically underreported. Thus, statistical models that do not take the data quality into account will produce biased estimates. To deal with this problem, counts in suspected regions are usually approached as censored information. The censored Poisson model can be considered, but all censored regions must be precisely known a priori, which is not a reasonable assumption in most practical situations. We introduce the random-censoring Poisson model (RCPM) which accounts for the uncertainty about both the count and the data reporting processes. Consequently, for each region, we will be able to estimate the relative risk for the event of interest as well as the censoring probability. To facilitate the posterior sampling process, we propose a Markov chain Monte Carlo scheme based on the data augmentation technique. We run a simulation study comparing the proposed RCPM with 2 competitive models. Different scenarios are considered. RCPM and censored Poisson model are applied to account for potential underreporting of early neonatal mortality counts in regions of Minas Gerais State, Brazil, where data quality is known to be poor.

Keywords: Bayesian inference; censoring; data augmentation; infant mortality; underreporting.

MeSH terms

  • Algorithms
  • Bayes Theorem
  • Biostatistics
  • Brazil / epidemiology
  • Computer Simulation
  • Humans
  • Infant
  • Infant Mortality
  • Infant, Newborn
  • Markov Chains
  • Models, Statistical*
  • Monte Carlo Method
  • Poisson Distribution*
  • Probability