Prediction of coronary artery lesions in children with Kawasaki syndrome based on machine learning

BMC Pediatr. 2024 Mar 5;24(1):158. doi: 10.1186/s12887-024-04608-2.

Abstract

Objective: Kawasaki syndrome (KS) is an acute vasculitis that affects children < 5 years of age and leads to coronary artery lesions (CAL) in about 20-25% of untreated cases. Machine learning (ML) is a branch of artificial intelligence (AI) that integrates complex data sets on a large scale and uses huge data to predict future events. The purpose of the present study was to use ML to present the model for early risk assessment of CAL in children with KS by different algorithms.

Methods: A total of 158 children were enrolled from Women and Children's Hospital, Qingdao University, and divided into 70-30% as the training sets and the test sets for modeling and validation studies. There are several classifiers are constructed for models including the random forest (RF), the logistic regression (LR), and the eXtreme Gradient Boosting (XGBoost). Data preprocessing is analyzed before applying the classifiers to modeling. To avoid the problem of overfitting, the 5-fold cross validation method was used throughout all the data.

Results: The area under the curve (AUC) of the RF model was 0.925 according to the validation of the test set. The average accuracy was 0.930 (95% CI, 0.905 to 0.956). The AUC of the LG model was 0.888 and the average accuracy was 0.893 (95% CI, 0,837 to 0.950). The AUC of the XGBoost model was 0.879 and the average accuracy was 0.935 (95% CI, 0.891 to 0.980).

Conclusion: The RF algorithm was used in the present study to construct a prediction model for CAL effectively, with an accuracy of 0.930 and AUC of 0.925. The novel model established by ML may help guide clinicians in the initial decision to make a more aggressive initial anti-inflammatory therapy. Due to the limitations of external validation and regional population characteristics, additional research is required to initiate a further application in the clinic.

Keywords: Coronary artery lesions; Kawasaki syndrome; Machine learning; Random forest.

MeSH terms

  • Aggression
  • Artificial Intelligence
  • Child
  • Coronary Vessels / diagnostic imaging
  • Female
  • Humans
  • Machine Learning
  • Mucocutaneous Lymph Node Syndrome* / complications
  • Mucocutaneous Lymph Node Syndrome* / diagnosis