Machine-learning-derived predictive score for early estimation of COVID-19 mortality risk in hospitalized patients

PLoS One. 2022 Sep 22;17(9):e0274171. doi: 10.1371/journal.pone.0274171. eCollection 2022.

Abstract

The clinical course of COVID-19 is highly variable. It is therefore essential to predict as early and accurately as possible the severity level of the disease in a COVID-19 patient who is admitted to the hospital. This means identifying the contributing factors of mortality and developing an easy-to-use score that could enable a fast assessment of the mortality risk using only information recorded at the hospitalization. A large database of adult patients with a confirmed diagnosis of COVID-19 (n = 15,628; with 2,846 deceased) admitted to Spanish hospitals between December 2019 and July 2020 was analyzed. By means of multiple machine learning algorithms, we developed models that could accurately predict their mortality. We used the information about classifiers' performance metrics and about importance and coherence among the predictors to define a mortality score that can be easily calculated using a minimal number of mortality predictors and yielded accurate estimates of the patient severity status. The optimal predictive model encompassed five predictors (age, oxygen saturation, platelets, lactate dehydrogenase, and creatinine) and yielded a satisfactory classification of survived and deceased patients (area under the curve: 0.8454 with validation set). These five predictors were additionally used to define a mortality score for COVID-19 patients at their hospitalization. This score is not only easy to calculate but also to interpret since it ranges from zero to eight, along with a linear increase in the mortality risk from 0% to 80%. A simple risk score based on five commonly available clinical variables of adult COVID-19 patients admitted to hospital is able to accurately discriminate their mortality probability, and its interpretation is straightforward and useful.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • COVID-19* / diagnosis
  • Creatinine
  • Hospital Mortality
  • Hospitalization
  • Humans
  • Lactate Dehydrogenases
  • Machine Learning
  • Retrospective Studies
  • Risk Assessment

Substances

  • Creatinine
  • Lactate Dehydrogenases

Grants and funding

The authors acknowledge the support provided by the Spanish Ministry of Science and Innovation (PID2020-119262RB-I00), the Generalitat Valenciana (AICO/2021/111), the UPV Research and Development Support Programme PAID-01-17, and the European Social Fund (ACIF/2018/165).