Calibration Drift Among Regression and Machine Learning Models for Hospital Mortality

AMIA Annu Symp Proc. 2018 Apr 16:2017:625-634. eCollection 2017.

Abstract

Advanced regression and machine learning models can provide personalized risk predictions to support clinical decision-making. We aimed to understand whether modeling methods impact the tendency of calibration to deteriorate as patient populations shift over time, with the goal of informing model updating practices. We developed models for 30-day hospital mortality using seven common regression and machine learning methods. Models were developed on 2006 admissions to Department of Veterans Affairs hospitals and validated on admissions in 2007-2013. All models maintained discrimination. Calibration was stable for the neural network model and declined for all other models. The L-2 penalized logistic regression and random forest models experienced smaller magnitudes of calibration drift than the other regression models. Calibration drift was linked with a changing case mix rather than shifts in predictoroutcome associations or outcome rate. Model updating protocols will need to be tailored to variations in calibration drift across methods.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Adult
  • Area Under Curve
  • Calibration
  • Female
  • Hospital Mortality*
  • Hospitals, Veterans
  • Humans
  • Logistic Models
  • Machine Learning*
  • Male
  • Models, Statistical*
  • Neural Networks, Computer*
  • Proportional Hazards Models
  • Risk
  • United States
  • United States Department of Veterans Affairs