Machine Learning Can Predict the Probability of Biologic Therapy in Patients with Inflammatory Bowel Disease

J Clin Med. 2022 Aug 5;11(15):4586. doi: 10.3390/jcm11154586.

Abstract

Background: Inflammatory bowel disease (IBD) is of high medical and socioeconomic relevance. Moderate and severe disease courses often require treatment with biologics. The aim of this study was to evaluate machine learning (ML)-based methods for the prediction of biologic therapy in IBD patients using a large prescription database.

Methods: The present retrospective cohort study utilized a longitudinal prescription database (LRx). Patients with at least one prescription for an intestinal anti-inflammatory agent from a gastroenterologist between January 2015 and July 2021 were included. Patients who had received an initial biologic therapy prescription (infliximab, adalimumab, golimumab, vedolizumab, or ustekinumab) were categorized as the "biologic group". The potential predictors included in the machine learning-based models were age, sex, and the 100 most frequently prescribed drugs within 12 months prior to the index date. Six machine learning-based methods were used for the prediction of biologic therapy.

Results: A total of 122,089 patients were included in this study. Of these, 15,824 (13.0%) received at least one prescription for a biologic drug. The Light Gradient Boosting Machine had the best performance (accuracy = 74%) and was able to correctly identify 78.5% of the biologics patients and 72.6% of the non-biologics patients in the testing dataset. The most important variable was prednisolone, followed by lower age, mesalazine, budesonide, and ferric iron.

Conclusions: In summary, this study reveals the advantages of ML-based models in predicting biologic therapy in IBD patients based on pre-treatment and demographic variables. There is a need for further studies in this regard that take into account individual patient characteristics, i.e., genetics and gut microbiota, to adequately address the challenges of finding optimal treatment strategies for patients with IBD.

Keywords: Light Gradient Boosting Machine; biologics; inflammatory bowel disease; machine learning.

Grants and funding

No specific funding was received for the completion of this study. In general, work in the lab of T.L. was funded by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program through the ERC Consolidator Grant PhaseControl (Grant Agreement n° 771083). The lab of T.L. was further supported by German Cancer Aid (Deutsche Krebshilfe 110043 and a Mildred Scheel Professorship) and the German Research Foundation (SFB-TRR57/P06, LU 1360/3-1, CRC1380/A01, and CA 830/3-1).