Utilizing time series data embedded in electronic health records to develop continuous mortality risk prediction models using hidden Markov models: A sepsis case study

Stat Methods Med Res. 2020 Nov;29(11):3409-3423. doi: 10.1177/0962280220929045. Epub 2020 Jun 17.

Abstract

Continuous mortality risk monitoring is instrumental to manage a patient's care and to efficiently utilize the limited hospital resources. Due to incompleteness and irregularities of electronic health records (EHR), developing continuous mortality risk prediction using EHR data is a challenge. In this study, we propose a framework to continuously monitor mortality risk, and apply it to the real-world EHR data. The proposed method employs hidden Markov models (temporal technique) that take account of both the previous state of patient's health and the current value of clinical signs. Following the Sepsis-3 definition, we selected 3898 encounters of patients with suspected infection to compare the performance of temporal and non-temporal methods (Decision Tree (DT), Logistic Regression (LR), Naive Bayes (NB), Random Forest (RF), and Support Vector Machine (SVM)). The area under receiver operating characteristics (AUROC) curve, sensitivity, specificity and G-mean were used as performance measures. On the selected data, the AUROC of the proposed temporal framework (0.87) is 9-12% greater than the nontemporal methods (DT: 0.78, NB: 0.79, SVM: 0.79, LR: 0.80 and RF: 0.80). The results also show that our model (G-mean=0.78) provides a better balance between sensitivity and specificity compared to clinically acceptable bed-side criteria (G-mean=0.71). The proposed framework leverages the longitudinal data available in EHR and performs better than the non-temporal methods. The proposed method facilitates information related to the time of change of the patient's health that may help practitioners to plan early and develop effective treatment strategies.

Keywords: Statistical models; decision support system; hidden Markov model; infection; sepsis; time series model.

MeSH terms

  • Bayes Theorem
  • Electronic Health Records*
  • Humans
  • Logistic Models
  • Machine Learning
  • Sepsis*