» Articles » PMID: 38459719

Algorithmic Identification of Treatment-Emergent Adverse Events From Clinical Notes Using Large Language Models: A Pilot Study in Inflammatory Bowel Disease

Abstract

Outpatient clinical notes are a rich source of information regarding drug safety. However, data in these notes are currently underutilized for pharmacovigilance due to methodological limitations in text mining. Large language models (LLMs) like Bidirectional Encoder Representations from Transformers (BERT) have shown progress in a range of natural language processing tasks but have not yet been evaluated on adverse event (AE) detection. We adapted a new clinical LLM, University of California - San Francisco (UCSF)-BERT, to identify serious AEs (SAEs) occurring after treatment with a non-steroid immunosuppressant for inflammatory bowel disease (IBD). We compared this model to other language models that have previously been applied to AE detection. We annotated 928 outpatient IBD notes corresponding to 928 individual patients with IBD for all SAE-associated hospitalizations occurring after treatment with a non-steroid immunosuppressant. These notes contained 703 SAEs in total, the most common of which was failure of intended efficacy. Out of eight candidate models, UCSF-BERT achieved the highest numerical performance on identifying drug-SAE pairs from this corpus (accuracy 88-92%, macro F1 61-68%), with 5-10% greater accuracy than previously published models. UCSF-BERT was significantly superior at identifying hospitalization events emergent to medication use (P < 0.01). LLMs like UCSF-BERT achieve numerically superior accuracy on the challenging task of SAE detection from clinical notes compared with prior methods. Future work is needed to adapt this methodology to improve model performance and evaluation using multicenter data and newer architectures like Generative pre-trained transformer (GPT). Our findings support the potential value of using large language models to enhance pharmacovigilance.

Citing Articles

Large Language Models Outperform Traditional Natural Language Processing Methods in Extracting Patient-Reported Outcomes in Inflammatory Bowel Disease.

Patel P, Davis C, Ralbovsky A, Tinoco D, Williams C, Slatter S Gastro Hep Adv. 2025; 4(2):100563.

PMID: 39877865 PMC: 11772946. DOI: 10.1016/j.gastha.2024.10.003.


Large language models outperform traditional natural language processing methods in extracting patient-reported outcomes in IBD.

Patel P, Davis C, Ralbovsky A, Tinoco D, Williams C, Slatter S medRxiv. 2024; .

PMID: 39281744 PMC: 11398594. DOI: 10.1101/2024.09.05.24313139.


How Artificial Intelligence Will Transform Clinical Care, Research, and Trials for Inflammatory Bowel Disease.

Silverman A, Shung D, Stidham R, Kochhar G, Iacucci M Clin Gastroenterol Hepatol. 2024; 23(3):428-439.e4.

PMID: 38992406 PMC: 11719376. DOI: 10.1016/j.cgh.2024.05.048.


A comparative study of zero-shot inference with large language models and supervised modeling in breast cancer pathology classification.

Sushil M, Zack T, Mandair D, Zheng Z, Wali A, Yu Y Res Sq. 2024; .

PMID: 38405831 PMC: 10889046. DOI: 10.21203/rs.3.rs-3914899/v1.

References
1.
Chaparro M, Garre A, Ricart E, Iborra M, Mesonero F, Vera I . Short and long-term effectiveness and safety of vedolizumab in inflammatory bowel disease: results from the ENEIDA registry. Aliment Pharmacol Ther. 2018; 48(8):839-851. DOI: 10.1111/apt.14930. View

2.
Anderson K, Moss K, Campbell B, Moote D, Kakazu K, Hyams J . Follicular Dendritic Cell Sarcoma in a Patient With Adolescent-Onset Crohn's Disease Exposed to Multiple Immunomodulator and Biologic Therapies. JPGN Rep. 2023; 3(3):e231. PMC: 10158454. DOI: 10.1097/PG9.0000000000000231. View

3.
Hochreiter S, Schmidhuber J . Long short-term memory. Neural Comput. 1997; 9(8):1735-80. DOI: 10.1162/neco.1997.9.8.1735. View

4.
Norgeot B, Muenzen K, Peterson T, Fan X, Glicksberg B, Schenk G . Protected Health Information filter (Philter): accurately and securely de-identifying free-text clinical notes. NPJ Digit Med. 2020; 3:57. PMC: 7156708. DOI: 10.1038/s41746-020-0258-y. View

5.
Hazell L, Shakir S . Under-reporting of adverse drug reactions : a systematic review. Drug Saf. 2006; 29(5):385-96. DOI: 10.2165/00002018-200629050-00003. View