Identification of Suspected Tuberculosis Patients Based on Natural Language Processing of Chest Radiograph Reports
Overview
Authors
Affiliations
Identification of eligible patients from electronically available patient data is a key difficulty in computerizing clinical practice guidelines because a large amount of the relevant data is stored as free text. We have been using MedLEE (Medical Language Extraction and Encoding System), a natural language processing system, to encode the clinical information in all chest radiograph and mammogram reports. This paper describes a retrospective study to determine if MedLEE can identify patients at risk for having tuberculosis (TB) based on their admission chest radiographs. Reports of 171 adult inpatients with culture-positive TB during 1992 and 1993 were manually coded (by a TB specialist) using seven terms suggestive of TB, and were also encoded by MedLEE. Using manual coding as the gold standard, MedLEE agreed on the classification of 152/171 (88.9%) reports--129/142 (90.8%) suspicious for TB and 23/29 (79.3%) not suspicious for TB; and 1072/1197 (89.6%) terms indicative of TB. Analysis showed that most of the discrepancies were caused by MedLEE not finding the location of the infiltrate. By ignoring the location of the infiltrate, the agreement became 157/171 (91.8%) reports and 946/1026 (92.2%) terms. Thus, natural language processing offers a practical alternative for using free-text reports to determine patient eligibility for computerized clinical practice guidelines.
Combining text mining with clinical decision support in clinical practice: a scoping review.
van de Burgt B, Wasylewicz A, Dullemond B, Grouls R, Egberts T, Bouwman A J Am Med Inform Assoc. 2022; 30(3):588-603.
PMID: 36512578 PMC: 9933076. DOI: 10.1093/jamia/ocac240.
ACE: the Advanced Cohort Engine for searching longitudinal patient records.
Callahan A, Polony V, Posada J, Banda J, Gombar S, Shah N J Am Med Inform Assoc. 2021; 28(7):1468-1479.
PMID: 33712854 PMC: 8279796. DOI: 10.1093/jamia/ocab027.
Petch J, Batt J, Murray J, Mamdani M JMIR Med Inform. 2019; 7(4):e12575.
PMID: 31682579 PMC: 6913750. DOI: 10.2196/12575.
Cohort selection for clinical trials using hierarchical neural network.
Xiong Y, Shi X, Chen S, Jiang D, Tang B, Wang X J Am Med Inform Assoc. 2019; 26(11):1203-1208.
PMID: 31305921 PMC: 7647215. DOI: 10.1093/jamia/ocz099.
Barbour K, Hesdorffer D, Tian N, Yozawitz E, McGoldrick P, Wolf S Epilepsia. 2019; 60(6):1209-1220.
PMID: 31111463 PMC: 11771062. DOI: 10.1111/epi.15966.