Development of a peripheral blood transcriptomic gene signature to predict bronchopulmonary dysplasia

Am J Physiol Lung Cell Mol Physiol. 2023 Jan 1;324(1):L76-L87. doi: 10.1152/ajplung.00250.2022. Epub 2022 Dec 6.

Abstract

Bronchopulmonary dysplasia (BPD) is the most common lung disease of extreme prematurity, yet mechanisms that associate with or identify neonates with increased susceptibility for BPD are largely unknown. Combining artificial intelligence with gene expression data is a novel approach that may assist in better understanding mechanisms underpinning chronic lung disease and in stratifying patients at greater risk for BPD. The objective of this study is to develop an early peripheral blood transcriptomic signature that can predict preterm neonates at risk for developing BPD. Secondary analysis of whole blood microarray data from 97 very low birth weight neonates on day of life 5 was performed. BPD was defined as positive pressure ventilation or oxygen requirement at 28 days of age. Participants were randomly assigned to a training (70%) and testing cohort (30%). Four gene-centric machine learning models were built, and their discriminatory abilities were compared with gestational age or birth weight. This study adheres to the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD) statement. Neonates with BPD (n = 62 subjects) exhibited a lower median gestational age (26.0 wk vs. 30.0 wk, P < 0.01) and birth weight (800 g vs. 1,280 g, P < 0.01) compared with non-BPD neonates. From an initial pool (33,252 genes/patient), 4,523 genes exhibited a false discovery rate (FDR) <1%. The area under the receiver operating characteristic curve (AUC) for predicting BPD utilizing gestational age or birth weight was 87.8% and 87.2%, respectively. The machine learning models, using a combination of five genes, revealed AUCs ranging between 85.8% and 96.1%. Pathways integral to T cell development and differentiation were associated with BPD. A derived five-gene whole blood signature can accurately predict BPD in the first week of life.

Keywords: bioinformatics; bronchopulmonary dysplasia; machine learning; prediction; whole microarray.

Publication types

  • Randomized Controlled Trial
  • Research Support, N.I.H., Extramural
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Artificial Intelligence
  • Birth Weight
  • Bronchopulmonary Dysplasia* / diagnosis
  • Bronchopulmonary Dysplasia* / genetics
  • Gestational Age
  • Humans
  • Infant, Newborn
  • Infant, Premature
  • Transcriptome / genetics