Nonparametric discrete survival function estimation with uncertain endpoints using an internal validation subsample

Biometrics. 2015 Sep;71(3):772-81. doi: 10.1111/biom.12316. Epub 2015 Apr 27.

Abstract

When a true survival endpoint cannot be assessed for some subjects, an alternative endpoint that measures the true endpoint with error may be collected, which often occurs when obtaining the true endpoint is too invasive or costly. We develop an estimated likelihood function for the situation where we have both uncertain endpoints for all participants and true endpoints for only a subset of participants. We propose a nonparametric maximum estimated likelihood estimator of the discrete survival function of time to the true endpoint. We show that the proposed estimator is consistent and asymptotically normal. We demonstrate through extensive simulations that the proposed estimator has little bias compared to the naïve Kaplan-Meier survival function estimator, which uses only uncertain endpoints, and more efficient with moderate missingness compared to the complete-case Kaplan-Meier survival function estimator, which uses only available true endpoints. Finally, we apply the proposed method to a data set for estimating the risk of detecting Alzheimer's disease from the Alzheimer's Disease Neuroimaging Initiative.

Keywords: Measurement error; Missing data; Nonparametric survival analysis; Uncertain endpoints; Validation sample.

Publication types

  • Evaluation Study
  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Validation Study

MeSH terms

  • Alzheimer Disease / diagnosis*
  • Alzheimer Disease / mortality*
  • Computer Simulation
  • Data Interpretation, Statistical
  • Humans
  • Likelihood Functions*
  • Models, Statistical*
  • Neuroimaging / statistics & numerical data*
  • Prevalence
  • Reproducibility of Results
  • Risk Assessment / methods
  • Sample Size
  • Sensitivity and Specificity
  • Survival Analysis*