» Articles » PMID: 8947725

Identification of Suspected Tuberculosis Patients Based on Natural Language Processing of Chest Radiograph Reports

Overview
Date 1996 Jan 1
PMID 8947725
Citations 39
Authors
Affiliations
Soon will be listed here.
Abstract

Identification of eligible patients from electronically available patient data is a key difficulty in computerizing clinical practice guidelines because a large amount of the relevant data is stored as free text. We have been using MedLEE (Medical Language Extraction and Encoding System), a natural language processing system, to encode the clinical information in all chest radiograph and mammogram reports. This paper describes a retrospective study to determine if MedLEE can identify patients at risk for having tuberculosis (TB) based on their admission chest radiographs. Reports of 171 adult inpatients with culture-positive TB during 1992 and 1993 were manually coded (by a TB specialist) using seven terms suggestive of TB, and were also encoded by MedLEE. Using manual coding as the gold standard, MedLEE agreed on the classification of 152/171 (88.9%) reports--129/142 (90.8%) suspicious for TB and 23/29 (79.3%) not suspicious for TB; and 1072/1197 (89.6%) terms indicative of TB. Analysis showed that most of the discrepancies were caused by MedLEE not finding the location of the infiltrate. By ignoring the location of the infiltrate, the agreement became 157/171 (91.8%) reports and 946/1026 (92.2%) terms. Thus, natural language processing offers a practical alternative for using free-text reports to determine patient eligibility for computerized clinical practice guidelines.

Citing Articles

Combining text mining with clinical decision support in clinical practice: a scoping review.

van de Burgt B, Wasylewicz A, Dullemond B, Grouls R, Egberts T, Bouwman A J Am Med Inform Assoc. 2022; 30(3):588-603.

PMID: 36512578 PMC: 9933076. DOI: 10.1093/jamia/ocac240.


ACE: the Advanced Cohort Engine for searching longitudinal patient records.

Callahan A, Polony V, Posada J, Banda J, Gombar S, Shah N J Am Med Inform Assoc. 2021; 28(7):1468-1479.

PMID: 33712854 PMC: 8279796. DOI: 10.1093/jamia/ocab027.


Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study.

Petch J, Batt J, Murray J, Mamdani M JMIR Med Inform. 2019; 7(4):e12575.

PMID: 31682579 PMC: 6913750. DOI: 10.2196/12575.


Cohort selection for clinical trials using hierarchical neural network.

Xiong Y, Shi X, Chen S, Jiang D, Tang B, Wang X J Am Med Inform Assoc. 2019; 26(11):1203-1208.

PMID: 31305921 PMC: 7647215. DOI: 10.1093/jamia/ocz099.


Automated detection of sudden unexpected death in epilepsy risk factors in electronic medical records using natural language processing.

Barbour K, Hesdorffer D, Tian N, Yozawitz E, McGoldrick P, Wolf S Epilepsia. 2019; 60(6):1209-1220.

PMID: 31111463 PMC: 11771062. DOI: 10.1111/epi.15966.


References
1.
Tierney W, McDonald C, Martin D, ROGERS M . Computerized display of past test results. Effect on outpatient testing. Ann Intern Med. 1987; 107(4):569-74. DOI: 10.7326/0003-4819-107-4-569. View

2.
Elson R, Connelly D . Computerized patient records in primary care. Their role in mediating guideline-driven physician behavior change. Arch Fam Med. 1995; 4(8):698-705. DOI: 10.1001/archfami.4.8.698. View

3.
Johnson S, Friedman C, Cimino J, Clark T, Hripcsak G, Clayton P . Conceptual data model for a central patient database. Proc Annu Symp Comput Appl Med Care. 1991; :381-5. PMC: 2247559. View

4.
Bloom B, Murray C . Tuberculosis: commentary on a reemergent killer. Science. 1992; 257(5073):1055-64. DOI: 10.1126/science.257.5073.1055. View

5.
Safran C, Rind D, Davis R, Ives D, Sands D, Currier J . Guidelines for management of HIV infection with computer-based patient's record. Lancet. 1995; 346(8971):341-6. DOI: 10.1016/s0140-6736(95)92226-1. View