Discrete-time survival analysis in the critically ill: a deep learning approach using heterogeneous data

Hans-Christian Thorsen-Meyer; Davide Placido; Benjamin Skov Kaas-Hansen; Anna P Nielsen; Theis Lange; Annelaura B Nielsen; Palle Toft; Jens Schierbeck; Thomas Strøm; Piotr J Chmura; Marc Heimann; Kirstine Belling; Anders Perner; Søren Brunak

doi:10.1038/s41746-022-00679-6

Discrete-time survival analysis in the critically ill: a deep learning approach using heterogeneous data

NPJ Digit Med. 2022 Sep 14;5(1):142. doi: 10.1038/s41746-022-00679-6.

Authors

Hans-Christian Thorsen-Meyer^{1

2}, Davide Placido¹, Benjamin Skov Kaas-Hansen^{1

3

4}, Anna P Nielsen¹, Theis Lange⁴, Annelaura B Nielsen¹, Palle Toft^{5

6}, Jens Schierbeck^{5

6}, Thomas Strøm^{5

6

7}, Piotr J Chmura¹, Marc Heimann⁸, Kirstine Belling¹, Anders Perner², Søren Brunak⁹

Affiliations

¹ Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200, Copenhagen, Denmark.
² Department of Intensive Care, Rigshospitalet, Copenhagen University Hospital, DK-2100, Copenhagen, Denmark.
³ Clinical Pharmacology Unit, Zealand University Hospital, DK-4000, Roskilde, Denmark.
⁴ Department of Public Health, Section of Biostatistics, University of Copenhagen, DK-1014, Copenhagen, Denmark.
⁵ Department of Anaesthesiology and Intensive Care, Odense University Hospital, DK-5000, Odense, Denmark.
⁶ Department of Clinical Research, University of Southern Denmark, DK-5000, Odense, Denmark.
⁷ Department of Anaesthesia and Critical Care Medicine, Hospital Sønderjylland, University Hospital of Southern Denmark, Odense, Denmark.
⁸ Centre for IT, Medical Technology and Telephony Services, Capital Region of Denmark, DK-2100, Copenhagen, Denmark.
⁹ Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200, Copenhagen, Denmark. soren.brunak@cpr.ku.dk.

Abstract

Prediction of survival for patients in intensive care units (ICUs) has been subject to intense research. However, no models exist that embrace the multiverse of data in ICUs. It is an open question whether deep learning methods using automated data integration with minimal pre-processing of mixed data domains such as free text, medical history and high-frequency data can provide discrete-time survival estimates for individual ICU patients. We trained a deep learning model on data from patients admitted to ten ICUs in the Capital Region of Denmark and the Region of Southern Denmark between 2011 and 2018. Inspired by natural language processing we mapped the electronic patient record data to an embedded representation and fed the data to a recurrent neural network with a multi-label output layer representing the chance of survival at different follow-up times. We evaluated the performance using the time-dependent concordance index. In addition, we quantified and visualized the drivers of survival predictions using the SHAP methodology. We included 37,355 admissions of 29,417 patients in our study. Our deep learning models outperformed traditional Cox proportional-hazard models with concordance index in the ranges 0.72-0.73, 0.71-0.72, 0.71, and 0.69-0.70, for models applied at baseline 0, 24, 48, and 72 h, respectively. Deep learning models based on a combination of entity embeddings and survival modelling is a feasible approach to obtain individualized survival estimates in data-rich settings such as the ICU. The interpretable nature of the models enables us to understand the impact of the different data domains.

Abstract

Grants and funding