A generalizable and easy-to-use COVID-19 stratification model for the next pandemic via immune-phenotyping and machine learning

Front Immunol. 2024 Mar 27:15:1372539. doi: 10.3389/fimmu.2024.1372539. eCollection 2024.

Abstract

Introduction: The coronavirus disease 2019 (COVID-19) pandemic has affected billions of people worldwide, and the lessons learned need to be concluded to get better prepared for the next pandemic. Early identification of high-risk patients is important for appropriate treatment and distribution of medical resources. A generalizable and easy-to-use COVID-19 severity stratification model is vital and may provide references for clinicians.

Methods: Three COVID-19 cohorts (one discovery cohort and two validation cohorts) were included. Longitudinal peripheral blood mononuclear cells were collected from the discovery cohort (n = 39, mild = 15, critical = 24). The immune characteristics of COVID-19 and critical COVID-19 were analyzed by comparison with those of healthy volunteers (n = 16) and patients with mild COVID-19 using mass cytometry by time of flight (CyTOF). Subsequently, machine learning models were developed based on immune signatures and the most valuable laboratory parameters that performed well in distinguishing mild from critical cases. Finally, single-cell RNA sequencing data from a published study (n = 43) and electronic health records from a prospective cohort study (n = 840) were used to verify the role of crucial clinical laboratory and immune signature parameters in the stratification of COVID-19 severity.

Results: Patients with COVID-19 were determined with disturbed glucose and tryptophan metabolism in two major innate immune clusters. Critical patients were further characterized by significant depletion of classical dendritic cells (cDCs), regulatory T cells (Tregs), and CD4+ central memory T cells (Tcm), along with increased systemic interleukin-6 (IL-6), interleukin-12 (IL-12), and lactate dehydrogenase (LDH). The machine learning models based on the level of cDCs and LDH showed great potential for predicting critical cases. The model performances in severity stratification were validated in two cohorts (AUC = 0.77 and 0.88, respectively) infected with different strains in different periods. The reference limits of cDCs and LDH as biomarkers for predicting critical COVID-19 were 1.2% and 270.5 U/L, respectively.

Conclusion: Overall, we developed and validated a generalizable and easy-to-use COVID-19 severity stratification model using machine learning algorithms. The level of cDCs and LDH will assist clinicians in making quick decisions during future pandemics.

Keywords: COVID-19; classical dendritic cells; decision-making; lactate dehydrogenase; machine learning; mass cytometry by time of flight (CyTOF); severity stratification.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • COVID-19*
  • Humans
  • L-Lactate Dehydrogenase
  • Leukocytes, Mononuclear
  • Machine Learning
  • Pandemics
  • Prospective Studies
  • SARS-CoV-2

Substances

  • L-Lactate Dehydrogenase

Grants and funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by the National Natural Science Foundation of China (82172126, 82372219), Capital’s Funds for Health Improvement and Research (2022-2G-40911), Natural Science Foundation of Beijing Municipality (7232215, M22036), the special fund of the Beijing Clinical Key Specialty Construction Program (2021), and Peking University Third Hospital Cohort Study Project (BYSYZD2022010).