Explainable machine learning model for predicting the occurrence of postoperative malnutrition in children with congenital heart disease

Clin Nutr. 2022 Jan;41(1):202-210. doi: 10.1016/j.clnu.2021.11.006. Epub 2021 Nov 10.

Abstract

Background & aims: Malnutrition is persistent in 50%-75% of children with congenital heart disease (CHD) after surgery, and early prediction is crucial for nutritional intervention. The aim of this study was to develop and validate machine learning (ML) models to predict the malnutrition status of children with CHD. We used explainable ML methods to provide insight into the model's predictions and outcomes.

Methods: This prospective cohort study included consecutive children with CHD admitted to the hospital from December 2017 to May 2020. The cohort data were divided into the training and test data sets based on the follow-up time. The outcome of the study was CHD child malnutrition 1 year after surgery, the primary outcome was an underweight status, and the secondary outcomes were stunted and wasting status. We used five ML algorithms with multiple features to construct prediction models, and the performance of these ML models was measured by an area under the receiver operating characteristic curve (AUC) analysis. We also used the permutation importance and SHapley Additive exPlanations (SHAP) to determine the importance of the selected features and interpret the ML models.

Results: We enrolled 536 children with CHD who underwent complete repair. The proportions of children with an underweight, stunted, or wasting status 1 year after surgery were 18.1% (97/536), 12.1% (65/536), and 17.5% (94/536), respectively. All patients contributed to the generation of 115 useable features, which allowed us to build models to predict malnutrition. Five prediction algorithms were used, and the XGBoost model achieved the greatest AUC in all outcomes. The results obtained from the permutation importance and SHAP analyses showed that the 1-month postoperative WAZ-score, discharge WAZ score and preoperative WAZ score were the top 3 important features in predicting an underweight status in the XGBoost algorithm. Regarding the stunted status, the top 3 important features were the 1-month postoperative HAZ score, discharge HAZ score, and aortic clamping time. Regarding the wasting status, the top 3 important features were the hospital length of stay, formula intake, and discharge WHZ-score. We also used a narrative case report as an example to describe the clinical manifestations and predicted the primary outcomes of two children.

Conclusions: We developed an ML model (XGBoost) that provides accurate early predictions of malnutrition 1-year postoperatively in children with CHD. Because the ML model is explainable, it may better enable clinicians to better understand the reasoning underlying the outcome. Our study could aid in determining individual treatment and nutritional follow-up strategies for children with CHD.

Keywords: Children; Congenital heart disease; Interpretability; Machine learning; Malnutrition.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms
  • Clinical Decision Rules*
  • Female
  • Heart Defects, Congenital / physiopathology*
  • Heart Defects, Congenital / surgery
  • Humans
  • Infant
  • Machine Learning / standards*
  • Male
  • Malnutrition / diagnosis*
  • Malnutrition / etiology
  • Postoperative Complications / diagnosis*
  • Postoperative Complications / etiology
  • Postoperative Period
  • Predictive Value of Tests
  • Prospective Studies
  • ROC Curve