Machine Learning Classifier Models Can Identify Acute Respiratory Distress Syndrome Phenotypes Using Readily Available Clinical Data

Am J Respir Crit Care Med. 2020 Oct 1;202(7):996-1004. doi: 10.1164/rccm.202002-0347OC.

Abstract

Rationale: Two distinct phenotypes of acute respiratory distress syndrome (ARDS) with differential clinical outcomes and responses to randomly assigned treatment have consistently been identified in randomized controlled trial cohorts using latent class analysis. Plasma biomarkers, key components in phenotype identification, currently lack point-of-care assays and represent a barrier to the clinical implementation of phenotypes.Objectives: The objective of this study was to develop models to classify ARDS phenotypes using readily available clinical data only.Methods: Three randomized controlled trial cohorts served as the training data set (ARMA [High vs. Low Vt], ALVEOLI [Assessment of Low Vt and Elevated End-Expiratory Pressure to Obviate Lung Injury], and FACTT [Fluids and Catheter Treatment Trial]; n = 2,022), and a fourth served as the validation data set (SAILS [Statins for Acutely Injured Lungs from Sepsis]; n = 745). A gradient-boosted machine algorithm was used to develop classifier models using 24 variables (demographics, vital signs, laboratory, and respiratory variables) at enrollment. In two secondary analyses, the ALVEOLI and FACTT cohorts each, individually, served as the validation data set, and the remaining combined cohorts formed the training data set for each analysis. Model performance was evaluated against the latent class analysis-derived phenotype.Measurements and Main Results: For the primary analysis, the model accurately classified the phenotypes in the validation cohort (area under the receiver operating characteristic curve [AUC], 0.95; 95% confidence interval [CI], 0.94-0.96). Using a probability cutoff of 0.5 to assign class, inflammatory biomarkers (IL-6, IL-8, and sTNFR-1; P < 0.0001) and 90-day mortality (38% vs. 24%; P = 0.0002) were significantly higher in the hyperinflammatory phenotype as classified by the model. Model accuracy was similar when ALVEOLI (AUC, 0.94; 95% CI, 0.92-0.96) and FACTT (AUC, 0.94; 95% CI, 0.92-0.95) were used as the validation cohorts. Significant treatment interactions were observed with the clinical classifier model-assigned phenotypes in both ALVEOLI (P = 0.0113) and FACTT (P = 0.0072) cohorts.Conclusions: ARDS phenotypes can be accurately identified using machine learning models based on readily available clinical data and may enable rapid phenotype identification at the bedside.

Keywords: ARDS phenotypes; classifier models; machine learning.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Age Factors
  • Area Under Curve
  • Bicarbonates / metabolism
  • Bilirubin / metabolism
  • Biomarkers, Tumor
  • Blood Pressure
  • Carbon Dioxide / metabolism
  • Creatinine / metabolism
  • Humans
  • Inflammation
  • Intercellular Adhesion Molecule-1 / metabolism
  • Interleukin-6 / metabolism
  • Interleukin-8 / metabolism
  • Latent Class Analysis
  • Leukocyte Count
  • Machine Learning*
  • Mortality
  • Oxygen / metabolism
  • Partial Pressure
  • Phenotype
  • Plasminogen Activator Inhibitor 1 / metabolism
  • Platelet Count
  • Prognosis
  • Protein C / metabolism
  • Pulmonary Ventilation
  • Randomized Controlled Trials as Topic
  • Receptors, Tumor Necrosis Factor, Type I / metabolism
  • Respiratory Distress Syndrome / classification*
  • Respiratory Distress Syndrome / immunology
  • Respiratory Distress Syndrome / physiopathology
  • Respiratory Distress Syndrome / therapy
  • Serum Albumin / metabolism
  • Tidal Volume
  • Vasoconstrictor Agents / therapeutic use
  • Vital Signs

Substances

  • Bicarbonates
  • Biomarkers, Tumor
  • CXCL8 protein, human
  • ICAM1 protein, human
  • IL6 protein, human
  • Interleukin-6
  • Interleukin-8
  • Plasminogen Activator Inhibitor 1
  • Protein C
  • Receptors, Tumor Necrosis Factor, Type I
  • SERPINE1 protein, human
  • Serum Albumin
  • Vasoconstrictor Agents
  • human epithelial antigen-125
  • Intercellular Adhesion Molecule-1
  • Carbon Dioxide
  • Creatinine
  • Bilirubin
  • Oxygen