» Articles » PMID: 35834334

Language-agnostic Pharmacovigilant Text Mining to Elicit Side Effects from Clinical Notes and Hospital Medication Records

Abstract

We sought to craft a drug safety signalling pipeline associating latent information in clinical free text with exposures to single drugs and drug pairs. Data arose from 12 secondary and tertiary public hospitals in two Danish regions, comprising approximately half the Danish population. Notes were operationalised with a fastText embedding, based on which we trained 10 270 neural-network models (one for each distinct single-drug/drug-pair exposure) predicting the risk of exposure given an embedding vector. We included 2 905 251 admissions between May 2008 and June 2016, with 13 740 564 distinct drug prescriptions; the median number of prescriptions was 5 (IQR: 3-9) and in 1 184 340 (41%) admissions patients used ≥5 drugs concomitantly. A total of 10 788 259 clinical notes were included, with 179 441 739 tokens retained after pruning. Of 345 single-drug signals reviewed, 28 (8.1%) represented possibly undescribed relationships; 186 (54%) signals were clinically meaningful. Sixteen (14%) of the 115 drug-pair signals were possible interactions, and two (1.7%) were known. In conclusion, we built a language-agnostic pipeline for mining associations between free-text information and medication exposure without manual curation, predicting not the likely outcome of a range of exposures but also the likely exposures for outcomes of interest. Our approach may help overcome limitations of text mining methods relying on curated data in English and can help leverage non-English free text for pharmacovigilance.

Citing Articles

Year 2022 in Medical Natural Language Processing: Availability of Language Models as a Step in the Democratization of NLP in the Biomedical Area.

Grouin C, Grabar N Yearb Med Inform. 2023; 32(1):244-252.

PMID: 38147866 PMC: 10751107. DOI: 10.1055/s-0043-1768752.


Use of Electronic Health Record Data for Drug Safety Signal Identification: A Scoping Review.

Davis S, Zabotka L, Desai R, Wang S, Maro J, Coughlin K Drug Saf. 2023; 46(8):725-742.

PMID: 37340238 PMC: 11635839. DOI: 10.1007/s40264-023-01325-0.


Language-agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records.

Kaas-Hansen B, Placido D, Rodriguez C, Thorsen-Meyer H, Gentile S, Nielsen A Basic Clin Pharmacol Toxicol. 2022; 131(4):282-293.

PMID: 35834334 PMC: 9541191. DOI: 10.1111/bcpt.13773.

References
1.
Lombardi N, Bettiol A, Crescioli G, Ravaldi C, Bonaiuti R, Venegoni M . Risk of hospitalisation associated with benzodiazepines and z-drugs in Italy: a nationwide multicentre study in emergency departments. Intern Emerg Med. 2020; 15(7):1291-1302. DOI: 10.1007/s11739-020-02339-7. View

2.
Moride Y, Haramburu F, Requejo A, Begaud B . Under-reporting of adverse drug reactions in general practice. Br J Clin Pharmacol. 1997; 43(2):177-81. PMC: 2042725. DOI: 10.1046/j.1365-2125.1997.05417.x. View

3.
Alvaro N, Conway M, Doan S, Lofi C, Overington J, Collier N . Crowdsourcing Twitter annotations to identify first-hand experiences of prescription drug use. J Biomed Inform. 2015; 58:280-287. DOI: 10.1016/j.jbi.2015.11.004. View

4.
Rezaallah B, Lewis D, Pierce C, Zeilhofer H, Berg B . Social Media Surveillance of Multiple Sclerosis Medications Used During Pregnancy and Breastfeeding: Content Analysis. J Med Internet Res. 2019; 21(8):e13003. PMC: 6702799. DOI: 10.2196/13003. View

5.
Patrignani A, Palmieri G, Ciampani N, Moretti V, Mariani A, Racca L . [Under-reporting of adverse drug reactions, a problem that also involves medicines subject to additional monitoring. Preliminary data from a single-center experience on novel oral anticoagulants]. G Ital Cardiol (Rome). 2018; 19(1):54-61. DOI: 10.1714/2852.28779. View