A MACHINE LEARNING MODEL DERIVED FROM ANALYSIS OF TIME-COURSE GENE-EXPRESSION DATASETS REVEALS TEMPORALLY STABLE GENE MARKERS PREDICTIVE OF SEPSIS MORTALITY

Shock. 2023 Nov 1;60(5):671-677. doi: 10.1097/SHK.0000000000002226. Epub 2023 Sep 23.

Abstract

Sepsis is associated with significant mortality and morbidity among critically ill patients admitted to intensive care units and represents a major health challenge globally. Given the significant clinical and biological heterogeneity among patients and the dynamic nature of the host immune response, identifying those at high risk of poor outcomes remains a critical challenge. Here, we performed secondary analysis of publicly available time-series gene-expression datasets from peripheral blood of patients admitted to the intensive care unit to elucidate temporally stable gene-expression markers between sepsis survivors and nonsurvivors. Using a limited set of genes that were determined to be temporally stable, we derived a dynamical model using a Support Vector Machine classifier to accurately predict the mortality of sepsis patients. Our model had robust performance in a test dataset, where patients' transcriptome was sampled at alternate time points, with an area under the curve of 0.89 (95% CI, 0.82-0.96) upon 5-fold cross-validation. We also identified 7 potential biomarkers of sepsis mortality (STAT5A, CX3CR1, LCP1, SNRPG, RPS27L, LSM5, SHCBP1) that require future validation. Pending prospective testing, our model may be used to identify sepsis patients with high risk of mortality accounting for the dynamic nature of the disease and with potential therapeutic implications.

MeSH terms

  • Biomarkers
  • Humans
  • Intensive Care Units
  • Machine Learning
  • Prospective Studies
  • Sepsis*
  • Shc Signaling Adaptor Proteins / genetics
  • Transcriptome
  • snRNP Core Proteins / genetics

Substances

  • Biomarkers
  • SNRPG protein, human
  • snRNP Core Proteins
  • SHCBP1 protein, human
  • Shc Signaling Adaptor Proteins