Algorithmic Identification of Treatment-Emergent Adverse Events From Clinical Notes Using Large Language Models: A Pilot Study in Inflammatory Bowel Disease

Anna L Silverman; Madhumita Sushil; Balu Bhasuran; Dana Ludwig; James Buchanan; Rebecca Racz; Mahalakshmi Parakala; Samer El-Kamary; Ohenewaa Ahima; Artur Belov; Lauren Choi; Monisha Billings; Yan Li; Nadia Habal; Qi Liu; Jawahar Tiwari; Atul J Butte; Vivek A Rudrapatna

doi:10.1002/cpt.3226

Algorithmic Identification of Treatment-Emergent Adverse Events From Clinical Notes Using Large Language Models: A Pilot Study in Inflammatory Bowel Disease

Clin Pharmacol Ther. 2024 Jun;115(6):1391-1399. doi: 10.1002/cpt.3226. Epub 2024 Mar 8.

Authors

Anna L Silverman^#^{1

2}, Madhumita Sushil^#³, Balu Bhasuran^#³, Dana Ludwig³, James Buchanan³, Rebecca Racz⁴, Mahalakshmi Parakala⁵, Samer El-Kamary⁴, Ohenewaa Ahima⁴, Artur Belov⁴, Lauren Choi⁴, Monisha Billings⁴, Yan Li⁴, Nadia Habal⁴, Qi Liu⁴, Jawahar Tiwari⁴, Atul J Butte^{3

6}, Vivek A Rudrapatna^{3

7}

Affiliations

¹ Division of Gastroenterology and Hepatology, Department of Medicine, Mayo Clinic, Phoenix, Arizona, USA.
² Department of Medicine, University of California, San Diego, La Jolla, California, USA.
³ Bakar Computational Health Sciences Institute, San Francisco, California, USA.
⁴ United States Food and Drug Administration, Silver Spring, Maryland, USA.
⁵ Department of Public Health, University of California, Berkeley, Berkeley, California, USA.
⁶ Center for Data-Driven Insights and Innovation, University of California Health, Oakland, California, USA.
⁷ Division of Gastroenterology and Hepatology, Department of Medicine, University of California, San Francisco, San Francisco, California, USA.

^# Contributed equally.

PMID: 38459719
PMCID: PMC11090709 (available on 2025-06-01)
DOI: 10.1002/cpt.3226

Abstract

Outpatient clinical notes are a rich source of information regarding drug safety. However, data in these notes are currently underutilized for pharmacovigilance due to methodological limitations in text mining. Large language models (LLMs) like Bidirectional Encoder Representations from Transformers (BERT) have shown progress in a range of natural language processing tasks but have not yet been evaluated on adverse event (AE) detection. We adapted a new clinical LLM, University of California - San Francisco (UCSF)-BERT, to identify serious AEs (SAEs) occurring after treatment with a non-steroid immunosuppressant for inflammatory bowel disease (IBD). We compared this model to other language models that have previously been applied to AE detection. We annotated 928 outpatient IBD notes corresponding to 928 individual patients with IBD for all SAE-associated hospitalizations occurring after treatment with a non-steroid immunosuppressant. These notes contained 703 SAEs in total, the most common of which was failure of intended efficacy. Out of eight candidate models, UCSF-BERT achieved the highest numerical performance on identifying drug-SAE pairs from this corpus (accuracy 88-92%, macro F1 61-68%), with 5-10% greater accuracy than previously published models. UCSF-BERT was significantly superior at identifying hospitalization events emergent to medication use (P < 0.01). LLMs like UCSF-BERT achieve numerically superior accuracy on the challenging task of SAE detection from clinical notes compared with prior methods. Future work is needed to adapt this methodology to improve model performance and evaluation using multicenter data and newer architectures like Generative pre-trained transformer (GPT). Our findings support the potential value of using large language models to enhance pharmacovigilance.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Adverse Drug Reaction Reporting Systems
Algorithms*
Data Mining / methods
Drug-Related Side Effects and Adverse Reactions / diagnosis
Electronic Health Records
Female
Hospitalization / statistics & numerical data
Humans
Immunosuppressive Agents* / adverse effects
Inflammatory Bowel Diseases* / drug therapy
Male
Natural Language Processing*
Pharmacovigilance*
Pilot Projects

Substances

Immunosuppressive Agents

Abstract

Publication types

MeSH terms

Substances

Grants and funding