The Wally plot approach to assess the calibration of clinical prediction models

Lifetime Data Anal. 2019 Jan;25(1):150-167. doi: 10.1007/s10985-017-9414-3. Epub 2017 Dec 6.

Abstract

A prediction model is calibrated if, roughly, for any percentage x we can expect that x subjects out of 100 experience the event among all subjects that have a predicted risk of x%. Typically, the calibration assumption is assessed graphically but in practice it is often challenging to judge whether a "disappointing" calibration plot is the consequence of a departure from the calibration assumption, or alternatively just "bad luck" due to sampling variability. We propose a graphical approach which enables the visualization of how much a calibration plot agrees with the calibration assumption to address this issue. The approach is mainly based on the idea of generating new plots which mimic the available data under the calibration assumption. The method handles the common non-trivial situations in which the data contain censored observations and occurrences of competing events. This is done by building on ideas from constrained non-parametric maximum likelihood estimation methods. Two examples from large cohort data illustrate our proposal. The 'wally' R package is provided to make the methodology easily usable.

Keywords: Censoring; Competing risks; Model validation; Prediction modeling; Residual plot; Survival analysis.

Publication types

  • Review

MeSH terms

  • Algorithms
  • Calibration*
  • Computer Simulation*
  • Data Analysis
  • Dementia / diagnosis
  • Dementia / mortality*
  • Dementia / therapy
  • Humans
  • Kidney Transplantation / methods
  • Kidney Transplantation / mortality*
  • Models, Statistical*
  • Predictive Value of Tests*
  • Sensitivity and Specificity
  • Survival Analysis