A machine learning model for identifying systemic lupus erythematosus through laboratory information system and electronic medical record

Clin Exp Rheumatol. 2024 Mar;42(3):702-712. doi: 10.55563/clinexprheumatol/jvdrpc. Epub 2023 Nov 15.

Abstract

Objectives: Systemic lupus erythematosus (SLE) is a heterogeneous autoimmune disease. Its diagnosis poses significant challenges especially at early stages and in atypical cases. The aim of this study was to develop a machine learning model based on common laboratory tests that can aid SLE diagnosis.

Methods: A standard protocol was developed to collect data of SLE and control immune diseases. A 10-fold cross-validation was performed in the modeling dataset (n=862), and an external dataset (n=198) was used for model validation. Machine learning algorithms were applied to construct a diagnostic model. Performance was evaluated based on area under the curve (AUC) values, F1-score, negative predictive value, positive predictive value, accuracy, sensitivity, and specificity.

Results: The optimal model was based on a random forest algorithm with 10 clinical features. Thrombin time, prothrombin activity, and uric acid contributed most to the diagnostic model. The SLE diagnostic model showed sufficient predictive accuracy, with AUC values of 0.8286 in the validation dataset.

Conclusions: Our diagnostic model based on 10 common laboratory tests identified the patients with SLE with high accuracy. An online version of the model can potentially be applied in clinical settings for the differential diagnosis of SLE.

MeSH terms

  • Algorithms
  • Clinical Laboratory Information Systems*
  • Electronic Health Records
  • Humans
  • Lupus Erythematosus, Systemic* / diagnosis
  • Machine Learning