Raman spectroscopy combined with machine learning algorithms for rapid detection Primary Sjögren's syndrome associated with interstitial lung disease

Photodiagnosis Photodyn Ther. 2022 Dec:40:103057. doi: 10.1016/j.pdpdt.2022.103057. Epub 2022 Aug 6.

Abstract

Background: Interstitial lung disease (ILD) is a major complication of Primary Sjögren's syndrome (pSS) patients.It is one of the main factors leading to death. The aim of this study is to evaluate the value of serum Raman spectroscopy combined with machine learning algorithms in the discriminatory diagnosis of patients with Primary Sjögren's syndrome associated with interstitial lung disease (pSS-ILD).

Methods: Raman spectroscopy was performed on the serum of 30 patients with pSS, 28 patients with pSS-ILD and 30 healthy controls (HC). First, the data were pre-processed using baseline correction, smoothing, outlier removal and normalization operations. Then principal component analysis (PCA) is used to reduce the dimension of data. Finally, support vector machine(SVM), k nearest neighbor (KNN) and random forest (RF) models are established for classification.

Results: In this study, SVM, KNN and RF were used as classification models, where SVM chooses polynomial kernel function (poly). The average accuracy, sensitivity, and precision of the three models were obtained after dimensionality reduction. The Accuracy of SVM (poly) was 5.71% higher than KNN and 6.67% higher than RF; Sensitivity was 5.79% higher than KNN and 8.56% higher than RF; Precision was 6.19% higher than KNN and 7.45% higher than RF. It can be seen that the SVM (poly) had better discriminative effect. In summary, SVM (poly) had a fine classification effect, and the average accuracy, sensitivity and precision of this model reached 89.52%, 91.27% and 89.52%, respectively, with an AUC value of 0.921.

Conclusions: This study demonstrates that serum RS combined with machine learning algorithms is a valuable tool for diagnosing patients with pSS-ILD. It has promising applications.

Keywords: Interstitial lung disease; Machine learning; Primary Sjögren's syndrome; Raman spectroscopy.

MeSH terms

  • Algorithms
  • Humans
  • Lung Diseases, Interstitial* / complications
  • Lung Diseases, Interstitial* / diagnosis
  • Lung Diseases, Interstitial* / drug therapy
  • Machine Learning
  • Photochemotherapy* / methods
  • Sjogren's Syndrome* / complications
  • Sjogren's Syndrome* / diagnosis
  • Spectrum Analysis, Raman
  • Support Vector Machine