» Articles » PMID: 31304378

Deep Learning Predicts Hip Fracture Using Confounding Patient and Healthcare Variables

Overview
Journal NPJ Digit Med
Date 2019 Jul 16
PMID 31304378
Citations 85
Authors
Affiliations
Soon will be listed here.
Abstract

Hip fractures are a leading cause of death and disability among older adults. Hip fractures are also the most commonly missed diagnosis on pelvic radiographs, and delayed diagnosis leads to higher cost and worse outcomes. Computer-aided diagnosis (CAD) algorithms have shown promise for helping radiologists detect fractures, but the image features underpinning their predictions are notoriously difficult to understand. In this study, we trained deep-learning models on 17,587 radiographs to classify fracture, 5 patient traits, and 14 hospital process variables. All 20 variables could be individually predicted from a radiograph, with the best performances on scanner model (AUC = 1.00), scanner brand (AUC = 0.98), and whether the order was marked "priority" (AUC = 0.79). Fracture was predicted moderately well from the image (AUC = 0.78) and better when combining image features with patient data (AUC = 0.86, DeLong paired AUC comparison,  = 2e-9) or patient data plus hospital process features (AUC = 0.91,  = 1e-21). Fracture prediction on a test set that balanced fracture risk across patient variables was significantly lower than a random test set (AUC = 0.67, DeLong unpaired AUC comparison,  = 0.003); and on a test set with fracture risk balanced across patient and hospital process variables, the model performed randomly (AUC = 0.52, 95% CI 0.46-0.58), indicating that these variables were the main source of the model's fracture predictions. A single model that directly combines image features, patient, and hospital process data outperforms a Naive Bayes ensemble of an image-only model prediction, patient, and hospital process data. If CAD algorithms are inexplicably leveraging patient and process variables in their predictions, it is unclear how radiologists should interpret their predictions in the context of other known patient data. Further research is needed to illuminate deep-learning decision processes so that computers and clinicians can effectively cooperate.

Citing Articles

The effect of resizing on the natural appearance of scintigraphic images: an image similarity analysis.

Ghassel S, Jabbarpour A, Lang J, Moulton E, Klein R Front Nucl Med. 2025; 4:1505377.

PMID: 39981066 PMC: 11839826. DOI: 10.3389/fnume.2024.1505377.


Step-by-step causal analysis of EHRs to ground decision-making.

Doutreligne M, Struja T, Abecassis J, Morgand C, Celi L, Varoquaux G PLOS Digit Health. 2025; 4(2):e0000721.

PMID: 39899627 PMC: 11790099. DOI: 10.1371/journal.pdig.0000721.


Deep Learning for Discrimination of Early Spinal Tuberculosis from Acute Osteoporotic Vertebral Fracture on CT.

Liu W, Wang J, Lei Y, Liu P, Han Z, Wang S Infect Drug Resist. 2025; 18():31-42.

PMID: 39776757 PMC: 11706012. DOI: 10.2147/IDR.S482584.


A data-driven framework for identifying patient subgroups on which an AI/machine learning model may underperform.

Subbaswamy A, Sahiner B, Petrick N, Pai V, Adams R, Diamond M NPJ Digit Med. 2024; 7(1):334.

PMID: 39572755 PMC: 11582698. DOI: 10.1038/s41746-024-01275-6.


External validation of an artificial intelligence multi-label deep learning model capable of ankle fracture classification.

Olczak J, Prijs J, IJpma F, Wallin F, Akbarian E, Doornberg J BMC Musculoskelet Disord. 2024; 25(1):788.

PMID: 39367349 PMC: 11451058. DOI: 10.1186/s12891-024-07884-2.


References
1.
Rossouw J, Anderson G, Prentice R, LaCroix A, Kooperberg C, Stefanick M . Risks and benefits of estrogen plus progestin in healthy postmenopausal women: principal results From the Women's Health Initiative randomized controlled trial. JAMA. 2002; 288(3):321-33. DOI: 10.1001/jama.288.3.321. View

2.
Grimes D, Schulz K . Bias and causal associations in observational research. Lancet. 2002; 359(9302):248-52. DOI: 10.1016/S0140-6736(02)07451-2. View

3.
Kirby M, Spritzer C . Radiographic detection of hip and pelvic fractures in the emergency department. AJR Am J Roentgenol. 2010; 194(4):1054-60. DOI: 10.2214/AJR.09.3295. View

4.
Madani A, Arnaout R, Mofrad M, Arnaout R . Fast and accurate view classification of echocardiograms using deep learning. NPJ Digit Med. 2019; 1. PMC: 6395045. DOI: 10.1038/s41746-017-0013-1. View

5.
Abramoff M, Lavin P, Birch M, Shah N, Folk J . Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices. NPJ Digit Med. 2019; 1:39. PMC: 6550188. DOI: 10.1038/s41746-018-0040-6. View