» Articles » PMID: 35005697

The Food and Drug Administration Biologics Effectiveness and Safety Initiative Facilitates Detection of Vaccine Administrations From Unstructured Data in Medical Records Through Natural Language Processing

Overview
Date 2022 Jan 10
PMID 35005697
Authors
Affiliations
Soon will be listed here.
Abstract

The Food and Drug Administration Center for Biologics Evaluation and Research conducts post-market surveillance of biologic products to ensure their safety and effectiveness. Studies have found that common vaccine exposures may be missing from structured data elements of electronic health records (EHRs), instead being captured in clinical notes. This impacts monitoring of adverse events following immunizations (AEFIs). For example, COVID-19 vaccines have been regularly administered outside of traditional medical settings. We developed a natural language processing (NLP) algorithm to mine unstructured clinical notes for vaccinations not captured in structured EHR data. A random sample of 1,000 influenza vaccine administrations, representing 995 unique patients, was extracted from a large U.S. EHR database. NLP techniques were used to detect administrations from the clinical notes in the training dataset [80% ( = 797) of patients]. The algorithm was applied to the validation dataset [20% ( = 198) of patients] to assess performance. Full medical charts for 28 randomly selected administration events in the validation dataset were reviewed by clinicians. The NLP algorithm was then applied across the entire dataset ( = 995) to quantify the number of additional events identified. A total of 3,199 administrations were identified in the structured data and clinical notes combined. Of these, 2,740 (85.7%) were identified in the structured data, while the NLP algorithm identified 1,183 (37.0%) administrations in clinical notes; 459 were not also captured in the structured data. This represents a 16.8% increase in the identification of vaccine administrations compared to using structured data alone. The validation of 28 vaccine administrations confirmed 27 (96.4%) as "definite" vaccine administrations; 18 (64.3%) had evidence of a vaccination event in the structured data, while 10 (35.7%) were found solely in the unstructured notes. We demonstrated the utility of an NLP algorithm to identify vaccine administrations not captured in structured EHR data. NLP techniques have the potential to improve detection of vaccine administrations not otherwise reported without increasing the analysis burden on physicians or practitioners. Future applications could include refining estimates of vaccine coverage and detecting other exposures, population characteristics, and outcomes not reliably captured in structured EHR data.

Citing Articles

Development of Interoperable Computable Phenotype Algorithms for Adverse Events of Special Interest to Be Used for Biologics Safety Surveillance: Validation Study.

Holdefer A, Pizarro J, Saunders-Hastings P, Beers J, Sang A, Hettinger A JMIR Public Health Surveill. 2024; 10:e49811.

PMID: 39008361 PMC: 11287092. DOI: 10.2196/49811.


Adverse events after first and second doses of COVID-19 vaccination in England: a national vaccine surveillance platform self-controlled case series study.

Tsang R, Agrawal U, Joy M, Byford R, Robertson C, Anand S J R Soc Med. 2023; 117(4):134-148.

PMID: 37921538 PMC: 11100448. DOI: 10.1177/01410768231205430.


Adverse events following first and second dose COVID-19 vaccination in England, October 2020 to September 2021: a national vaccine surveillance platform self-controlled case series study.

Tsang R, Joy M, Byford R, Robertson C, Anand S, Hinton W Euro Surveill. 2023; 28(3).

PMID: 36695484 PMC: 9853944. DOI: 10.2807/1560-7917.ES.2023.28.3.2200195.

References
1.
Shimabukuro T, Nguyen M, Martin D, DeStefano F . Safety monitoring in the Vaccine Adverse Event Reporting System (VAERS). Vaccine. 2015; 33(36):4398-405. PMC: 4632204. DOI: 10.1016/j.vaccine.2015.07.035. View

2.
Naidu R . Causality assessment: A brief insight into practices in pharmaceutical industry. Perspect Clin Res. 2013; 4(4):233-6. PMC: 3835968. DOI: 10.4103/2229-3485.120173. View

3.
Leite A, Andrews N, Thomas S . Near real-time vaccine safety surveillance using electronic health records-a systematic review of the application of statistical methods. Pharmacoepidemiol Drug Saf. 2016; 25(3):225-37. PMC: 5021108. DOI: 10.1002/pds.3966. View

4.
Zhou F, Shefer A, Wenger J, Messonnier M, Wang L, Lopez A . Economic evaluation of the routine childhood immunization program in the United States, 2009. Pediatrics. 2014; 133(4):577-85. DOI: 10.1542/peds.2013-0698. View

5.
Kimia A, Savova G, Landschaft A, Harper M . An Introduction to Natural Language Processing: How You Can Get More From Those Electronic Notes You Are Generating. Pediatr Emerg Care. 2015; 31(7):536-41. DOI: 10.1097/PEC.0000000000000484. View