Study of Effect of Drug Lexicons on Medication Extraction from Electronic Medical Records
Overview
Affiliations
Extraction of relevant information from free-text clinical notes is becoming increasingly important in healthcare to provide personalized care to patients. The purpose of this dictionary-based NLP study was to determine the effects of using varying drug lexicons to automatically extract medication information from electronic medical records. A convenience training sample of 52 documents, each containing at least one medication, and a randomized test sample of 100 documents were used in this study. The training and test set documents contained a total of 681 and 641 medications respectively. Three sets of drug lexicons were used as sources for medication extraction: first, containing drug name and generic name; second with drug, generic and short names; third with drug, generic and short names followed by filtering techniques. Extraction with the first drug lexicon resulted in 83.7% sensitivity and 96.2% specificity for the training set and 85.2% sensitivity and 96.9% specificity for the test set. Adding the list of short names used for drugs resulted in increasing sensitivity to 95.0%, but decreased the specificity to 79.2% for the training set. Similar results of increased sensitivity of 96.4% and 80.1% specificity were obtained for the test set. Combination of a set of filtering techniques with data from the second lexicon increased the specificity to 98.5% and 98.8% for the training and test sets respectively while slightly decreasing the sensitivity to 94.1% (training) and 95.8% (test). Overall, the lexicon with filtering resulted in the highest precision, i.e., extracted the highest number of medications while keeping the number of extracted non-medications low.
Extracting Drug Names and Associated Attributes From Discharge Summaries: Text Mining Study.
Alfattni G, Belousov M, Peek N, Nenadic G JMIR Med Inform. 2021; 9(5):e24678.
PMID: 33949962 PMC: 8135022. DOI: 10.2196/24678.
Jouffroy J, Feldman S, Lerner I, Rance B, Burgun A, Neuraz A JMIR Med Inform. 2021; 9(3):e17934.
PMID: 33724196 PMC: 8077811. DOI: 10.2196/17934.
A Novel Approach towards Medical Entity Recognition in Chinese Clinical Text.
Liang J, Xian X, He X, Xu M, Dai S, Xin J J Healthc Eng. 2017; 2017:4898963.
PMID: 29065612 PMC: 5516712. DOI: 10.1155/2017/4898963.
An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums.
Gao J, Liu N, Lawley M, Hu X J Healthc Eng. 2017; 2017:2460174.
PMID: 29065580 PMC: 5559930. DOI: 10.1155/2017/2460174.
Karystianis G, Sheppard T, Dixon W, Nenadic G BMC Med Inform Decis Mak. 2016; 16:18.
PMID: 26860263 PMC: 4748480. DOI: 10.1186/s12911-016-0255-x.