» Articles » PMID: 34179681

Information Extraction From FDA Drug Labeling to Enhance Product-Specific Guidance Assessment Using Natural Language Processing

Overview
Date 2021 Jun 28
PMID 34179681
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Towards the objectives of the UnitedStates Food and Drug Administration (FDA) generic drug science and research program, it is of vital importance in developing product-specific guidances (PSGs) with recommendations that can facilitate and guide generic product development. To generate a PSG, the assessor needs to retrieve supportive information about the drug product of interest, including from the drug labeling, which contain comprehensive information about drug products and instructions to physicians on how to use the products for treatment. Currently, although there are many drug labeling data resources, none of them including those developed by the FDA (e.g., Drugs@FDA) can cover all the FDA-approved drug products. Furthermore, these resources, housed in various locations, are often in forms that are not compatible or interoperable with each other. Therefore, there is a great demand for retrieving useful information from a large number of textual documents from different data resources to support an effective PSG development. To meet the needs, we developed a Natural Language Processing (NLP) pipeline by integrating multiple disparate publicly available data resources to extract drug product information with minimal human intervention. We provided a case study for identifying food effect information to illustrate how a machine learning model is employed to achieve accurate paragraph labeling. We showed that the pre-trained Bidirectional Encoder Representations from Transformers (BERT) model is able to outperform the traditional machine learning techniques, setting a new state-of-the-art for labelling food effect paragraphs from drug labeling and approved drug products datasets.

Citing Articles

Knowledge-guided generative artificial intelligence for automated taxonomy learning from drug labels.

Fang Y, Ryan P, Weng C J Am Med Inform Assoc. 2024; 31(9):2065-2075.

PMID: 38787964 PMC: 11339527. DOI: 10.1093/jamia/ocae105.


Development of Quantitative Comparative Approaches to Support Complex Generic Drug Development.

Gong Y, Barretto F, Tsong Y, Mousa Y, Ren K, Kozak D AAPS J. 2024; 26(1):15.

PMID: 38267593 DOI: 10.1208/s12248-024-00885-y.


AI in drug discovery and its clinical relevance.

Qureshi R, Irfan M, Gondal T, Khan S, Wu J, Hadi M Heliyon. 2023; 9(7):e17575.

PMID: 37396052 PMC: 10302550. DOI: 10.1016/j.heliyon.2023.e17575.


A natural language processing approach towards harmonisation of European medicinal product information.

Bergman E, Sherwood K, Forslund M, Arlett P, Westman G PLoS One. 2022; 17(10):e0275386.

PMID: 36264941 PMC: 9584511. DOI: 10.1371/journal.pone.0275386.

References
1.
Sharma M, Karrison T, Kell B, Wu K, Turcich M, Geary D . Evaluation of food effect on pharmacokinetics of vismodegib in advanced solid tumor patients. Clin Cancer Res. 2013; 19(11):3059-67. PMC: 3674138. DOI: 10.1158/1078-0432.CCR-12-3829. View

2.
Bisgin H, Liu Z, Fang H, Xu X, Tong W . Mining FDA drug labels using an unsupervised learning technique--topic modeling. BMC Bioinformatics. 2011; 12 Suppl 10:S11. PMC: 3236833. DOI: 10.1186/1471-2105-12-S10-S11. View

3.
Schadow G . HL7 Structured Product Labeling - electronic prescribing information for provider order entry decision support. AMIA Annu Symp Proc. 2006; :1108. PMC: 1560894. View

4.
Fang H, Harris S, Liu Z, Zhou G, Zhang G, Xu J . FDA drug labeling: rich resources to facilitate precision medicine, drug safety, and regulatory science. Drug Discov Today. 2016; 21(10):1566-1570. DOI: 10.1016/j.drudis.2016.06.006. View

5.
Fung K, Jao C, Demner-Fushman D . Extracting drug indication information from structured product labels using natural language processing. J Am Med Inform Assoc. 2013; 20(3):482-8. PMC: 3628062. DOI: 10.1136/amiajnl-2012-001291. View