» Articles » PMID: 18999156

Automated Knowledge Acquisition from Clinical Narrative Reports

Overview
Date 2008 Nov 13
PMID 18999156
Citations 22
Authors
Affiliations
Soon will be listed here.
Abstract

Knowledge of associations between biomedical entities, such as disease-symptoms, is critical for many automated biomedical applications. In this work, we develop automated methods for acquisition and discovery of medical knowledge embedded in clinical narrative reports. MedLEE, a Natural Language Processing (NLP) system, is applied to extract and encode clinical entities from narrative clinical reports obtained from New York-Presbyterian Hospital (NYPH), and associations between the clinical entities are determined based on statistical methods adjusted by volume tests. We focus on two types of entities, disease and symptom, in this study. Evaluation based on a random sample of disease-symptom associations indicates an overall recall of 90% and a precision of 92%. In conclusion, the preliminary study demonstrated that this method for knowledge acquisition of disease-symptom pairs from clinical reports is effective. The automated method is generalizable, and can be applied to detect other clinical associations, such as between diseases and medications.

Citing Articles

Real-World Insights Into Dementia Diagnosis Trajectory and Clinical Practice Patterns Unveiled by Natural Language Processing: Development and Usability Study.

Paek H, Fortinsky R, Lee K, Huang L, Maghaydah Y, Kuchel G JMIR Aging. 2025; 8:e65221.

PMID: 39999185 PMC: 11878476. DOI: 10.2196/65221.


Applying Natural Language Processing to Textual Data From Clinical Data Warehouses: Systematic Review.

Bazoge A, Morin E, Daille B, Gourraud P JMIR Med Inform. 2023; 11:e42477.

PMID: 38100200 PMC: 10757232. DOI: 10.2196/42477.


Natural language processing with machine learning methods to analyze unstructured patient-reported outcomes derived from electronic health records: A systematic review.

Sim J, Huang X, Horan M, Stewart C, Robison L, Hudson M Artif Intell Med. 2023; 146:102701.

PMID: 38042599 PMC: 10693655. DOI: 10.1016/j.artmed.2023.102701.


LeafAI: query generator for clinical cohort discovery rivaling a human programmer.

Dobbins N, Han B, Zhou W, Lan K, Kim H, Harrington R J Am Med Inform Assoc. 2023; 30(12):1954-1964.

PMID: 37550244 PMC: 10654856. DOI: 10.1093/jamia/ocad149.


Automatic illness prediction system through speech.

Abdulmohsin H, Al-Khateeb B, Hasan S, Dwivedi R Comput Electr Eng. 2022; 102:108224.

PMID: 35880184 PMC: 9302036. DOI: 10.1016/j.compeleceng.2022.108224.


References
1.
Aronson A, Bodenreider O, Chang H, Humphrey S, Mork J, Nelson S . The NLM Indexing Initiative. Proc AMIA Symp. 2000; :17-21. PMC: 2243970. View

2.
Walker J, Carayon P, Leveson N, Paulus R, Tooker J, Chin H . EHR safety: the way forward to safe and effective systems. J Am Med Inform Assoc. 2008; 15(3):272-7. PMC: 2409999. DOI: 10.1197/jamia.M2618. View

3.
Hahn U, Romacker M, Schulz S . Creating knowledge repositories from biomedical reports: the MEDSYNDIKATE text mining system. Pac Symp Biocomput. 2002; :338-49. View

4.
Friedman C, Shagina L, Lussier Y, Hripcsak G . Automated encoding of clinical documents based on natural language processing. J Am Med Inform Assoc. 2004; 11(5):392-402. PMC: 516246. DOI: 10.1197/jamia.M1552. View

5.
Baruch J . Progress in programming for processing English language medical records. Ann N Y Acad Sci. 1965; 126(2):795-804. DOI: 10.1111/j.1749-6632.1965.tb14324.x. View