» Articles » PMID: 35652114

External Validation of Deep Learning Algorithms for Radiologic Diagnosis: A Systematic Review

Overview
Date 2022 Jun 2
PMID 35652114
Authors
Affiliations
Soon will be listed here.
Abstract

Purpose: To assess generalizability of published deep learning (DL) algorithms for radiologic diagnosis.

Materials And Methods: In this systematic review, the PubMed database was searched for peer-reviewed studies of DL algorithms for image-based radiologic diagnosis that included external validation, published from January 1, 2015, through April 1, 2021. Studies using nonimaging features or incorporating non-DL methods for feature extraction or classification were excluded. Two reviewers independently evaluated studies for inclusion, and any discrepancies were resolved by consensus. Internal and external performance measures and pertinent study characteristics were extracted, and relationships among these data were examined using nonparametric statistics.

Results: Eighty-three studies reporting 86 algorithms were included. The vast majority (70 of 86, 81%) reported at least some decrease in external performance compared with internal performance, with nearly half (42 of 86, 49%) reporting at least a modest decrease (≥0.05 on the unit scale) and nearly a quarter (21 of 86, 24%) reporting a substantial decrease (≥0.10 on the unit scale). No study characteristics were found to be associated with the difference between internal and external performance.

Conclusion: Among published external validation studies of DL algorithms for image-based radiologic diagnosis, the vast majority demonstrated diminished algorithm performance on the external dataset, with some reporting a substantial performance decrease. Meta-Analysis, Computer Applications-Detection/Diagnosis, Neural Networks, Computer Applications-General (Informatics), Epidemiology, Technology Assessment, Diagnosis, Informatics . © RSNA, 2022.

Citing Articles

Development and evaluation of a 3D ensemble framework for automatic diagnosis of early osteonecrosis of the femoral head based on MRI: a multicenter diagnostic study.

Yang M, Hsiang F, Li C, Chen X, Zhang C, Sun G Front Surg. 2025; 12:1555749.

PMID: 40026392 PMC: 11868283. DOI: 10.3389/fsurg.2025.1555749.


Deep Learning in Thoracic Oncology: Meta-Analytical Insights into Lung Nodule Early-Detection Technologies.

Wang T, Wang C, Hong J, Chao H, Chen Y, Wu Y Cancers (Basel). 2025; 17(4).

PMID: 40002216 PMC: 11853243. DOI: 10.3390/cancers17040621.


Sharing reliable information worldwide: healthcare strategies based on artificial intelligence need external validation. Position paper.

Pennestri F, Cabitza F, Picerno N, Banfi G BMC Med Inform Decis Mak. 2025; 25(1):56.

PMID: 39905337 PMC: 11796012. DOI: 10.1186/s12911-025-02883-2.


Use of AI in Cardiac CT and MRI: A Scientific Statement from the ESCR, EuSoMII, NASCI, SCCT, SCMR, SIIM, and RSNA.

Mastrodicasa D, van Assen M, Huisman M, Leiner T, Williamson E, Nicol E Radiology. 2025; 314(1):e240516.

PMID: 39873607 PMC: 11783164. DOI: 10.1148/radiol.240516.


External Validation of Deep Learning Models for Classifying Etiology of Retinal Hemorrhage Using Diverse Fundus Photography Datasets.

Khosravi P, Huck N, Shahraki K, Ghafari E, Azimi R, Kim S Bioengineering (Basel). 2025; 12(1).

PMID: 39851294 PMC: 11760437. DOI: 10.3390/bioengineering12010020.


References
1.
Rudin C . Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead. Nat Mach Intell. 2022; 1(5):206-215. PMC: 9122117. DOI: 10.1038/s42256-019-0048-x. View

2.
Mongan J, Moy L, Kahn Jr C . Checklist for Artificial Intelligence in Medical Imaging (CLAIM): A Guide for Authors and Reviewers. Radiol Artif Intell. 2021; 2(2):e200029. PMC: 8017414. DOI: 10.1148/ryai.2020200029. View

3.
Xiao L, Li P, Sun F, Zhang Y, Xu C, Zhu H . Development and Validation of a Deep Learning-Based Model Using Computed Tomography Imaging for Predicting Disease Severity of Coronavirus Disease 2019. Front Bioeng Biotechnol. 2020; 8:898. PMC: 7411489. DOI: 10.3389/fbioe.2020.00898. View

4.
Nael K, Gibson E, Yang C, Ceccaldi P, Yoo Y, Das J . Automated detection of critical findings in multi-parametric brain MRI using a system of 3D neural networks. Sci Rep. 2021; 11(1):6876. PMC: 7994311. DOI: 10.1038/s41598-021-86022-7. View

5.
. AI diagnostics need attention. Nature. 2018; 555(7696):285. DOI: 10.1038/d41586-018-03067-x. View