Prediction of prognosis in immunoglobulin a nephropathy patients with focal crescent by machine learning

PLoS One. 2022 Mar 9;17(3):e0265017. doi: 10.1371/journal.pone.0265017. eCollection 2022.

Abstract

Background and objectives: Immunoglobulin a nephropathy (IgAN) is the most common primary glomerular disease in the world, with different clinical manifestations, varying severity of pathological changes, common complications of crescent formation in different proportions, and great individual heterogeneous in clinical outcomes. Therefore, we aim to develop a machine learning (ML) based predictive model for predicting the prognosis of IgAN with focal crescent formation and without obvious chronic renal lesions (glomerulosclerosis <25%).

Materials: We retrospectively reviewed biopsy-proven IgAN patients in our hospital and cooperative hospital from 2005 to 2017. The method of feature importance of random forest (RF) was applied to conduct feature exploration of feature variables to establish the characteristic variables that are closely related to the prognosis of focal crescent IgAN. Multiple ML algorithms were attempted to establish the prediction models. The area under the precision-recall curve (AUPRC) and the area under the receiver operating characteristic curve (AUROC) were applied to evaluate the predictive performance via three-fold cross validation (namely 2 training sets and 1 validation set).

Results: RF was used to screen the important features, the top three of which were baseline estimated glomerular filtration rate (eGFR), serum creatine and triglyceride. Ten important features were selected as important predictors for modeling on the basis of data-driven and medical selection, predictors include: age, baseline eGFR, serum creatine, serum triglycerides, complement 3(C3), proteinuria, mean arterial pressure (MAP) and Hematuria, crescents proportion of glomeruli, Global crescent proportion of glomeruli. In a variety of ML algorithms, the support vector machine (SVM) algorithm displayed better predictive performance, with Precision of 0.77, Recall of 0.77, F1-score of 0.73, accuracy of 0.77, AUROC of 79.57%, and AUPRC of 76.5%.

Conclusions: The SVM model is potentially useful for predicting the prognosis of IgAN patients with focal crescent shape and without obvious chronic renal lesions.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Creatine
  • Female
  • Glomerulonephritis, IGA* / pathology
  • Humans
  • Machine Learning
  • Male
  • Prognosis
  • Retrospective Studies

Substances

  • Creatine

Associated data

  • figshare/10.6084/m9.figshare.19127399
  • figshare/10.6084/m9.figshare.19127342

Grants and funding

The study was supported by the Research Project for Practice Development of National TCM Clinical Research Bases (Project No. JDZX2015202), the 2020 Guangdong Provincial Science and Technology Innovation Strategy Special FundGuangdong-Hong Kong-Macau Joint Lab(2020B1212030006), Industry Special of the State Administration of traditional Chinese Medicine(201407005) and Guangzhou University of Traditional Chinese Medicine double first-class and high-level university discipline collaborative innovation team project (2021xk66).