The Wally plot approach to assess the calibration of clinical prediction models

Paul Blanche; Thomas A Gerds; Claus T Ekstrøm

doi:10.1007/s10985-017-9414-3

The Wally plot approach to assess the calibration of clinical prediction models

Lifetime Data Anal. 2019 Jan;25(1):150-167. doi: 10.1007/s10985-017-9414-3. Epub 2017 Dec 6.

Authors

Paul Blanche¹, Thomas A Gerds², Claus T Ekstrøm²

Affiliations

¹ LMBA, University of South Brittany, Vannes, France. paul.blanche@univ-ubs.fr.
² Department of biostatistics, University of Copenhagen, Copenhagen, Denmark.

PMID: 29214550
DOI: 10.1007/s10985-017-9414-3

Abstract

A prediction model is calibrated if, roughly, for any percentage x we can expect that x subjects out of 100 experience the event among all subjects that have a predicted risk of x%. Typically, the calibration assumption is assessed graphically but in practice it is often challenging to judge whether a "disappointing" calibration plot is the consequence of a departure from the calibration assumption, or alternatively just "bad luck" due to sampling variability. We propose a graphical approach which enables the visualization of how much a calibration plot agrees with the calibration assumption to address this issue. The approach is mainly based on the idea of generating new plots which mimic the available data under the calibration assumption. The method handles the common non-trivial situations in which the data contain censored observations and occurrences of competing events. This is done by building on ideas from constrained non-parametric maximum likelihood estimation methods. Two examples from large cohort data illustrate our proposal. The 'wally' R package is provided to make the methodology easily usable.

Keywords: Censoring; Competing risks; Model validation; Prediction modeling; Residual plot; Survival analysis.

Publication types

Review

MeSH terms

Algorithms
Calibration*
Computer Simulation*
Data Analysis
Dementia / diagnosis
Dementia / mortality*
Dementia / therapy
Humans
Kidney Transplantation / methods
Kidney Transplantation / mortality*
Models, Statistical*
Predictive Value of Tests*
Sensitivity and Specificity
Survival Analysis