Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis

Seulkee Lee; Seonyoung Kang; Yeonghee Eun; Hong-Hee Won; Hyungjin Kim; Jaejoon Lee; Eun-Mi Koh; Hoon-Suk Cha

doi:10.1186/s13075-021-02635-3

Machine learning-based prediction model for responses of bDMARDs in patients with rheumatoid arthritis and ankylosing spondylitis

Arthritis Res Ther. 2021 Oct 9;23(1):254. doi: 10.1186/s13075-021-02635-3.

Authors

Seulkee Lee¹, Seonyoung Kang¹, Yeonghee Eun¹, Hong-Hee Won², Hyungjin Kim¹, Jaejoon Lee¹, Eun-Mi Koh¹, Hoon-Suk Cha³

Affiliations

¹ Department of Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea.
² Samsung Advanced Institute for Health Sciences & Technology (SAIHST), Sungkyunkwan University, Samsung Medical Center, Seoul, Republic of Korea.
³ Department of Medicine, Samsung Medical Center, Sungkyunkwan University School of Medicine, 81 Irwon-ro, Gangnam-gu, Seoul, 06351, Republic of Korea. hoonsuk.cha@samsung.com.

Abstract

Background: Few studies on rheumatoid arthritis (RA) have generated machine learning models to predict biologic disease-modifying antirheumatic drugs (bDMARDs) responses; however, these studies included insufficient analysis on important features. Moreover, machine learning is yet to be used to predict bDMARD responses in ankylosing spondylitis (AS). Thus, in this study, machine learning was used to predict such responses in RA and AS patients.

Methods: Data were retrieved from the Korean College of Rheumatology Biologics therapy (KOBIO) registry. The number of RA and AS patients in the training dataset were 625 and 611, respectively. We prepared independent test datasets that did not participate in any process of generating machine learning models. Baseline clinical characteristics were used as input features. Responders were defined as those who met the ACR 20% improvement response criteria (ACR20) and ASAS 20% improvement response criteria (ASAS20) in RA and AS, respectively, at the first follow-up. Multiple machine learning methods, including random forest (RF-method), were used to generate models to predict bDMARD responses, and we compared them with the logistic regression model.

Results: The RF-method model had superior prediction performance to logistic regression model (accuracy: 0.726 [95% confidence interval (CI): 0.725-0.730] vs. 0.689 [0.606-0.717], area under curve (AUC) of the receiver operating characteristic curve (ROC) 0.638 [0.576-0.658] vs. 0.565 [0.493-0.605], F1 score 0.841 [0.837-0.843] vs. 0.803 [0.732-0.828], AUC of the precision-recall curve 0.808 [0.763-0.829] vs. 0.754 [0.714-0.789]) with independent test datasets in patients with RA. However, machine learning and logistic regression exhibited similar prediction performance in AS patients. Furthermore, the patient self-reporting scales, which are patient global assessment of disease activity (PtGA) in RA and Bath Ankylosing Spondylitis Functional Index (BASFI) in AS, were revealed as the most important features in both diseases.

Conclusions: RF-method exhibited superior prediction performance for responses of bDMARDs to a conventional statistical method, i.e., logistic regression, in RA patients. In contrast, despite the comparable size of the dataset, machine learning did not outperform in AS patients. The most important features of both diseases, according to feature importance analysis were patient self-reporting scales.

Keywords: Ankylosing spondylitis; Machine learning; Rheumatoid arthritis; TNFi.

MeSH terms

Antirheumatic Agents* / therapeutic use
Arthritis, Rheumatoid* / diagnosis
Arthritis, Rheumatoid* / drug therapy
Humans
Machine Learning
Rheumatology*
Spondylitis, Ankylosing* / diagnosis
Spondylitis, Ankylosing* / drug therapy

Substances

Antirheumatic Agents