» Articles » PMID: 23304322

Risk Stratification of ICU Patients Using Topic Models Inferred from Unstructured Progress Notes

Overview
Date 2013 Jan 11
PMID 23304322
Citations 37
Authors
Affiliations
Soon will be listed here.
Abstract

We propose a novel approach for ICU patient risk stratification by combining the learned "topic" structure of clinical concepts (represented by UMLS codes) extracted from the unstructured nursing notes with physiologic data (from SAPS-I) for hospital mortality prediction. We used Hierarchical Dirichlet Processes (HDP), a non-parametric topic modeling technique, to automatically discover "topics" as shared groups of co-occurring UMLS clinical concepts. We evaluated the potential utility of the inferred topic structure in predicting hospital mortality using the nursing notes of 14,739 adult ICU patients (mortality 14.6%) from the MIMIC II database. Our results indicate that learned topic structure from the first 24-hour ICU nursing notes significantly improved the performance of the SAPS-I algorithm for hospital mortality prediction. The AUC for predicting hospital mortality from the first 24 hours of physiologic data and nursing text notes was 0.82. Using the physiologic data alone with the SAPS-I algorithm, an AUC of 0.72 was achieved. Thus, the clinical topics that were extracted and used to augment the SAPS-I algorithm significantly improved the performance of the baseline algorithm.

Citing Articles

AD-BERT: Using pre-trained language model to predict the progression from mild cognitive impairment to Alzheimer's disease.

Mao C, Xu J, Rasmussen L, Li Y, Adekkanattu P, Pacheco J J Biomed Inform. 2023; 144:104442.

PMID: 37429512 PMC: 11131134. DOI: 10.1016/j.jbi.2023.104442.


Classification of neurologic outcomes from medical notes using natural language processing.

Fernandes M, Valizadeh N, Alabsi H, Quadri S, Tesh R, Bucklin A Expert Syst Appl. 2023; 214.

PMID: 36865787 PMC: 9974159. DOI: 10.1016/j.eswa.2022.119171.


Natural Language Processing of Nursing Notes: An Integrative Review.

Mitha S, Schwartz J, Hobensack M, Cato K, Woo K, Smaldone A Comput Inform Nurs. 2023; 41(6):377-384.

PMID: 36730744 PMC: 11499545. DOI: 10.1097/CIN.0000000000000967.


Current status and trends in researches based on public intensive care databases: A scientometric investigation.

Li M, Du S Front Public Health. 2022; 10:912151.

PMID: 36187634 PMC: 9521614. DOI: 10.3389/fpubh.2022.912151.


Predicting Intensive Care Unit Length of Stay and Mortality Using Patient Vital Signs: Machine Learning Model Development and Validation.

Alghatani K, Ammar N, Rezgui A, Shaban-Nejad A JMIR Med Inform. 2021; 9(5):e21347.

PMID: 33949961 PMC: 8135024. DOI: 10.2196/21347.


References
1.
Le Gall J, Loirat P, Alperovitch A, Glaser P, Granthil C, Mathieu D . A simplified acute physiology score for ICU patients. Crit Care Med. 1984; 12(11):975-7. DOI: 10.1097/00003246-198411000-00012. View

2.
Cao H, Markatou M, Melton G, Chiang M, Hripcsak G . Mining a clinical data warehouse to discover disease-finding associations using co-occurrence statistics. AMIA Annu Symp Proc. 2006; :106-10. PMC: 1560759. View

3.
Cohen M, Grossman A, Morabito D, Knudson M, Butte A, Manley G . Identification of complex metabolic states in critically injured patients using bioinformatic cluster analysis. Crit Care. 2010; 14(1):R10. PMC: 2875524. DOI: 10.1186/cc8864. View

4.
Saria S, McElvain G, Rajani A, Penn A, Koller D . Combining Structured and Free-text Data for Automatic Coding of Patient Outcomes. AMIA Annu Symp Proc. 2011; 2010:712-6. PMC: 3041422. View

5.
Friedman C, Shagina L, Lussier Y, Hripcsak G . Automated encoding of clinical documents based on natural language processing. J Am Med Inform Assoc. 2004; 11(5):392-402. PMC: 516246. DOI: 10.1197/jamia.M1552. View