» Articles » PMID: 37486632

Validation of a Proprietary Deterioration Index Model and Performance in Hospitalized Adults

Abstract

Importance: The Deterioration Index (DTI), used by hospitals for predicting patient deterioration, has not been extensively validated externally, raising concerns about performance and equitable predictions.

Objective: To locally validate DTI performance and assess its potential for bias in predicting patient clinical deterioration.

Design, Setting, And Participants: This retrospective prognostic study included 13 737 patients admitted to 8 heterogenous Midwestern US hospitals varying in size and type, including academic, community, urban, and rural hospitals. Patients were 18 years or older and admitted between January 1 and May 31, 2021.

Exposure: DTI predictions made every 15 minutes.

Main Outcomes And Measures: Deterioration, defined as the occurrence of any of the following while hospitalized: mechanical ventilation, intensive care unit transfer, or death. Performance of the DTI was evaluated using area under the receiver operating characteristic curve (AUROC) and area under the precision recall curve (AUPRC). Bias measures were calculated across demographic subgroups.

Results: A total of 5 143 513 DTI predictions were made for 13 737 patients across 14 834 hospitalizations. Among 13 918 encounters, the mean (SD) age of patients was 60.3 (19.2) years; 7636 (54.9%) were female, 11 345 (81.5%) were White, and 12 392 (89.0%) were of other ethnicity than Hispanic or Latino. The prevalence of deterioration was 10.3% (n = 1436). The DTI produced AUROCs of 0.759 (95% CI, 0.756-0.762) at the observation level and 0.685 (95% CI, 0.671-0.700) at the encounter level. Corresponding AUPRCs were 0.039 (95% CI, 0.037-0.040) at the observation level and 0.248 (95% CI, 0.227-0.273) at the encounter level. Bias measures varied across demographic subgroups and were 14.0% worse for patients identifying as American Indian or Alaska Native and 19.0% worse for those who chose not to disclose their ethnicity.

Conclusions And Relevance: In this prognostic study, the DTI had modest ability to predict patient deterioration, with varying degrees of performance at the observation and encounter levels and across different demographic groups. Disparate performance across subgroups suggests the need for more transparency in model training data and reinforces the need to locally validate externally developed prediction models.

Citing Articles

Mind the Gap: Wearable Lactate and Glucose Monitors for Hospitalized Patients.

Guzzi J, Falter F, Kumar A, Perrino Jr A Cureus. 2025; 17(2):e78536.

PMID: 40062120 PMC: 11886925. DOI: 10.7759/cureus.78536.


Comparison of 1-year mortality predictions from vendor-supplied academic model for cancer patients.

Gensheimer M, Lu J, Ramchandran K PeerJ. 2025; 13:e18958.

PMID: 39959833 PMC: 11827575. DOI: 10.7717/peerj.18958.


Rebooting artificial intelligence for health.

Mitchell W, Wawira J, Celi L PLOS Glob Public Health. 2025; 5(1):e0004171.

PMID: 39823406 PMC: 11741560. DOI: 10.1371/journal.pgph.0004171.


Trustworthiness of a machine learning early warning model in medical and surgical inpatients.

Caraballo P, Meehan A, Fischer K, Rahman P, Simon G, Melton G JAMIA Open. 2025; 8(1):ooae156.

PMID: 39764169 PMC: 11702360. DOI: 10.1093/jamiaopen/ooae156.


Recommendations for Clinicians, Technologists, and Healthcare Organizations on the Use of Generative Artificial Intelligence in Medicine: A Position Statement from the Society of General Internal Medicine.

Crowe B, Shah S, Teng D, Ma S, DeCamp M, Rosenberg E J Gen Intern Med. 2024; 40(3):694-702.

PMID: 39531100 PMC: 11861482. DOI: 10.1007/s11606-024-09102-0.


References
1.
Muralitharan S, Nelson W, Di S, McGillion M, Devereaux P, Barr N . Machine Learning-Based Early Warning Systems for Clinical Deterioration: Systematic Scoping Review. J Med Internet Res. 2021; 23(2):e25187. PMC: 7892287. DOI: 10.2196/25187. View

2.
Acosta A, Garg S, Pham H, Whitaker M, Anglin O, OHalloran A . Racial and Ethnic Disparities in Rates of COVID-19-Associated Hospitalization, Intensive Care Unit Admission, and In-Hospital Death in the United States From March 2020 to February 2021. JAMA Netw Open. 2021; 4(10):e2130479. PMC: 8531997. DOI: 10.1001/jamanetworkopen.2021.30479. View

3.
Linnen D, Escobar G, Hu X, Scruth E, Liu V, Stephens C . Statistical Modeling and Aggregate-Weighted Scoring Systems in Prediction of Mortality and ICU Transfer: A Systematic Review. J Hosp Med. 2019; 14(3):161-169. PMC: 6628701. DOI: 10.12788/jhm.3151. View

4.
Mukherjee P, Shen T, Liu J, Mathai T, Shafaat O, Summers R . Confounding factors need to be accounted for in assessing bias by machine learning algorithms. Nat Med. 2022; 28(6):1159-1160. DOI: 10.1038/s41591-022-01847-7. View

5.
Verma A, Pou-Prom C, McCoy L, Murray J, Nestor B, Bell S . Developing and Validating a Prediction Model For Death or Critical Illness in Hospitalized Adults, an Opportunity for Human-Computer Collaboration. Crit Care Explor. 2023; 5(5):e0897. PMC: 10155889. DOI: 10.1097/CCE.0000000000000897. View