Nonparametric estimation of the cumulative incidence function under outcome misclassification using external validation data

Stat Med. 2019 Dec 20;38(29):5512-5527. doi: 10.1002/sim.8380. Epub 2019 Oct 24.

Abstract

Misclassification of outcomes or event types is common in health sciences research and can lead to serious bias when estimating the cumulative incidence functions in settings with competing risks. Recent work has shown how to estimate nonparametric cumulative incidence functions in the presence of nondifferential outcome misclassification when the misclassification probabilities are known. Here, we extend this approach to account for misclassification that is differential with respect to important predictors of the outcome using misclassification probabilities estimated from external validation data. Moreover, we propose a bootstrap approach in which the observations from both the main study data and the external validation study are resampled to allow the uncertainty in the misclassification probabilities to propagate through the analysis into the final confidence intervals, ensuring appropriate confidence interval coverage probabilities. The proposed estimator is shown to be uniformly consistent and simulation studies indicate that both the estimator and the standard error estimation approach perform well in finite samples. The methodology is applied to estimate the cumulative incidence of death and disengagement from HIV care in a large cohort of HIV infected individuals in sub-Saharan Africa, where a significant death underreporting issue leads to outcome misclassification. This analysis uses external validation data from a separate study conducted in the same country.

Keywords: competing risks; cumulative incidence; external validation data; misclassification.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Bias
  • Biostatistics
  • Computer Simulation
  • Confidence Intervals
  • HIV Infections / therapy
  • Humans
  • Incidence
  • Kenya / epidemiology
  • Models, Statistical*
  • Monte Carlo Method
  • Outcome Assessment, Health Care / classification
  • Outcome Assessment, Health Care / statistics & numerical data*
  • Statistics, Nonparametric
  • Validation Studies as Topic