» Articles » PMID: 39997335

Towards Trustworthy AI in Healthcare: Epistemic Uncertainty Estimation for Clinical Decision Support

Overview
Journal J Pers Med
Date 2025 Feb 25
PMID 39997335
Authors
Affiliations
Soon will be listed here.
Abstract

Widespread adoption of AI for medical decision-making is still hindered due to ethical and safety-related concerns. For AI-based decision support systems in healthcare settings, it is paramount to be reliable and trustworthy. Common deep learning approaches, however, have the tendency towards overconfidence when faced with unfamiliar or changing conditions. Inappropriate extrapolation beyond well-supported scenarios may have dire consequences highlighting the importance of the reliable estimation of local knowledge uncertainty and its communication to the end user. While neural network ensembles (ENNs) have been heralded as a potential solution to these issues for many years, deep learning methods, specifically modeling the amount of knowledge, promise more principled and reliable behavior. This study compares their reliability in clinical applications. We centered our analysis on experiments with low-dimensional toy datasets and the exemplary case study of mortality prediction for intensive care unit hospitalizations using Electronic Health Records (EHRs) from the MIMIC3 study. For predictions on the EHR time series, Encoder-Only Transformer models were employed. Knowledge uncertainty estimation is achieved with both ensemble and Spectral Normalized Neural Gaussian Process (SNGP) variants of the common Transformer model. We designed two datasets to test their reliability in detecting token level and more subtle discrepancies both for toy datasets and an EHR dataset. While both SNGP and ENN model variants achieve similar prediction performance (AUROC: ≈0.85, AUPRC: ≈0.52 for in-hospital mortality prediction from a selected MIMIC3 benchmark), the former demonstrates improved capabilities to quantify knowledge uncertainty for individual samples/patients. Methods including a knowledge model, such as SNGP, offer superior uncertainty estimation compared to traditional stochastic deep learning, leading to more trustworthy and safe clinical decision support.

References
1.
Si Y, Du J, Li Z, Jiang X, Miller T, Wang F . Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review. J Biomed Inform. 2021; 115:103671. PMC: 11290708. DOI: 10.1016/j.jbi.2020.103671. View

2.
Liang J, Li Y, Zhang Z, Shen D, Xu J, Zheng X . Adoption of Electronic Health Records (EHRs) in China During the Past 10 Years: Consecutive Survey Data Analysis and Comparison of Sino-American Challenges and Experiences. J Med Internet Res. 2021; 23(2):e24813. PMC: 7932845. DOI: 10.2196/24813. View

3.
Ristevski B, Chen M . Big Data Analytics in Medicine and Healthcare. J Integr Bioinform. 2018; 15(3). PMC: 6340124. DOI: 10.1515/jib-2017-0030. View

4.
Shickel B, Tighe P, Bihorac A, Rashidi P . Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis. IEEE J Biomed Health Inform. 2018; 22(5):1589-1604. PMC: 6043423. DOI: 10.1109/JBHI.2017.2767063. View

5.
Raghupathi W, Raghupathi V . Big data analytics in healthcare: promise and potential. Health Inf Sci Syst. 2015; 2:3. PMC: 4341817. DOI: 10.1186/2047-2501-2-3. View