» Articles » PMID: 24303276

Identifying Abdominal Aortic Aneurysm Cases and Controls Using Natural Language Processing of Radiology Reports

Overview
Specialty Biology
Date 2013 Dec 5
PMID 24303276
Citations 20
Authors
Affiliations
Soon will be listed here.
Abstract

Prevalence of abdominal aortic aneurysm (AAA) is increasing due to longer life expectancy and implementation of screening programs. Patient-specific longitudinal measurements of AAA are important to understand pathophysiology of disease development and modifiers of abdominal aortic size. In this paper, we applied natural language processing (NLP) techniques to process radiology reports and developed a rule-based algorithm to identify AAA patients and also extract the corresponding aneurysm size with the examination date. AAA patient cohorts were determined by a hierarchical approach that: 1) selected potential AAA reports using keywords; 2) classified reports into AAA-case vs. non-case using rules; and 3) determined the AAA patient cohort based on a report-level classification. Our system was built in an Unstructured Information Management Architecture framework that allows efficient use of existing NLP components. Our system produced an F-score of 0.961 for AAA-case report classification with an accuracy of 0.984 for aneurysm size extraction.

Citing Articles

Extracting cancer concepts from clinical notes using natural language processing: a systematic review.

Gholipour M, Khajouei R, Amiri P, Gohari S, Ahmadian L BMC Bioinformatics. 2023; 24(1):405.

PMID: 37898795 PMC: 10613366. DOI: 10.1186/s12859-023-05480-0.


Comprehensive Review of Natural Language Processing (NLP) in Vascular Surgery.

Lareyre F, Nasr B, Chaudhuri A, Di Lorenzo G, Carlier M, Raffort J EJVES Vasc Forum. 2023; 60:57-63.

PMID: 37822918 PMC: 10562666. DOI: 10.1016/j.ejvsvf.2023.09.002.


Near Real-time Natural Language Processing for the Extraction of Abdominal Aortic Aneurysm Diagnoses From Radiology Reports: Algorithm Development and Validation Study.

Gaviria-Valencia S, Murphy S, Kaggal V, McBane Ii R, Rooke T, Chaudhry R JMIR Med Inform. 2023; 11:e40964.

PMID: 36826984 PMC: 10007015. DOI: 10.2196/40964.


Evaluation of the portability of computable phenotypes with natural language processing in the eMERGE network.

Pacheco J, Rasmussen L, Wiley Jr K, Person T, Cronkite D, Sohn S Sci Rep. 2023; 13(1):1971.

PMID: 36737471 PMC: 9898520. DOI: 10.1038/s41598-023-27481-y.


Accurately Identifying Cerebroarterial Stenosis from Angiography Reports Using Natural Language Processing Approaches.

Lin C, Hsu K, Liang C, Lee T, Shih C, Fann Y Diagnostics (Basel). 2022; 12(8).

PMID: 36010232 PMC: 9406429. DOI: 10.3390/diagnostics12081882.


References
1.
Helgadottir A, Thorleifsson G, Magnusson K, Gretarsdottir S, Steinthorsdottir V, Manolescu A . The same sequence variant on 9p21 associates with myocardial infarction, abdominal aortic aneurysm and intracranial aneurysm. Nat Genet. 2008; 40(2):217-24. DOI: 10.1038/ng.72. View

2.
Friedman C, Shagina L, Lussier Y, Hripcsak G . Automated encoding of clinical documents based on natural language processing. J Am Med Inform Assoc. 2004; 11(5):392-402. PMC: 516246. DOI: 10.1197/jamia.M1552. View

3.
McCarty C, Chisholm R, Chute C, Kullo I, Jarvik G, Larson E . The eMERGE Network: a consortium of biorepositories linked to electronic medical records data for conducting genomic studies. BMC Med Genomics. 2011; 4:13. PMC: 3038887. DOI: 10.1186/1755-8794-4-13. View

4.
Kullo I, Ding K, Jouni H, Smith C, Chute C . A genome-wide association study of red blood cell traits using the electronic medical record. PLoS One. 2010; 5(9). PMC: 2946914. DOI: 10.1371/journal.pone.0013011. View

5.
Pakhomov S, Buntrock J, Chute C . Automating the assignment of diagnosis codes to patient encounters using example-based and machine learning techniques. J Am Med Inform Assoc. 2006; 13(5):516-25. PMC: 1561792. DOI: 10.1197/jamia.M2077. View