Immediate postnatal prediction of death or bronchopulmonary dysplasia among very preterm and very low birth weight infants based on gradient boosting decision trees algorithm: A nationwide database study in Japan

PLoS One. 2024 Mar 27;19(3):e0300817. doi: 10.1371/journal.pone.0300817. eCollection 2024.

Abstract

Introduction: Bronchopulmonary dysplasia (BPD) poses a substantial global health burden. Individualized treatment strategies based on early prediction of the development of BPD can mitigate preterm birth complications; however, previously suggested predictive models lack early postnatal applicability. We aimed to develop predictive models for BPD and mortality based on immediate postnatal clinical data.

Methods: Clinical information on very preterm and very low birth weight infants born between 2008 and 2018 was extracted from a nationwide Japanese database. The gradient boosting decision trees (GBDT) algorithm was adopted to predict BPD and mortality, using predictors within the first 6 h postpartum. We assessed the temporal validity and evaluated model adequacy using Shapley additive explanations (SHAP) values.

Results: We developed three predictive models using data from 39,488, 39,096, and 40,291 infants to predict "death or BPD," "death or severe BPD," and "death before discharge," respectively. These well-calibrated models achieved areas under the receiver operating characteristic curve of 0.828 (95% CI: 0.828-0.828), 0.873 (0.873-0.873), and 0.887 (0.887-0.888), respectively, outperforming the multivariable logistic regression models. SHAP value analysis identified predictors of BPD, including gestational age, size at birth, male sex, and persistent pulmonary hypertension. In SHAP value-based case clustering, the "death or BPD" prediction model stratified infants by gestational age and persistent pulmonary hypertension, whereas the other models for "death or severe BPD" and "death before discharge" commonly formed clusters of low mortality, extreme prematurity, low Apgar scores, and persistent pulmonary hypertension of the newborn.

Conclusions: GBDT models for predicting BPD and mortality, designed for use within 6 h postpartum, demonstrated superior prognostic performance. SHAP value-based clustering, a data-driven approach, formed clusters of clinical relevance. These findings suggest the efficacy of a GBDT algorithm for the early postnatal prediction of BPD.

MeSH terms

  • Bronchopulmonary Dysplasia* / complications
  • Bronchopulmonary Dysplasia* / diagnosis
  • Bronchopulmonary Dysplasia* / epidemiology
  • Decision Trees
  • Female
  • Gestational Age
  • Humans
  • Hypertension, Pulmonary* / complications
  • Infant
  • Infant, Extremely Premature
  • Infant, Newborn
  • Infant, Very Low Birth Weight
  • Japan / epidemiology
  • Pregnancy
  • Premature Birth*

Grants and funding

The author(s) received no specific funding for this work.