» Articles » PMID: 33580109

Predicting Adverse Outcomes Due to Diabetes Complications with Machine Learning Using Administrative Health Data

Overview
Journal NPJ Digit Med
Date 2021 Feb 13
PMID 33580109
Citations 30
Authors
Affiliations
Soon will be listed here.
Abstract

Across jurisdictions, government and health insurance providers hold a large amount of data from patient interactions with the healthcare system. We aimed to develop a machine learning-based model for predicting adverse outcomes due to diabetes complications using administrative health data from the single-payer health system in Ontario, Canada. A Gradient Boosting Decision Tree model was trained on data from 1,029,366 patients, validated on 272,864 patients, and tested on 265,406 patients. Discrimination was assessed using the AUC statistic and calibration was assessed visually using calibration plots overall and across population subgroups. Our model predicting three-year risk of adverse outcomes due to diabetes complications (hyper/hypoglycemia, tissue infection, retinopathy, cardiovascular events, amputation) included 700 features from multiple diverse data sources and had strong discrimination (average test AUC = 77.7, range 77.7-77.9). Through the design and validation of a high-performance model to predict diabetes complications adverse outcomes at the population level, we demonstrate the potential of machine learning and administrative health data to inform health planning and healthcare resource allocation for diabetes management.

Citing Articles

Pathways to chronic disease detection and prediction: Mapping the potential of machine learning to the pathophysiological processes while navigating ethical challenges.

Afrifa-Yamoah E, Adua E, Peprah-Yamoah E, Anto E, Opoku-Yamoah V, Acheampong E Chronic Dis Transl Med. 2025; 11(1):1-21.

PMID: 40051825 PMC: 11880127. DOI: 10.1002/cdt3.137.


Using machine learning to predict outcomes following transcarotid artery revascularization.

Li B, Eisenberg N, Beaton D, Lee D, Al-Omran L, Wijeysundera D Sci Rep. 2025; 15(1):3924.

PMID: 39890848 PMC: 11785798. DOI: 10.1038/s41598-024-81625-2.


Predicting lack of clinical improvement following varicose vein ablation using machine learning.

Li B, Eisenberg N, Beaton D, Lee D, Al-Omran L, Wijeysundera D J Vasc Surg Venous Lymphat Disord. 2024; 13(3):102162.

PMID: 39732288 PMC: 11803835. DOI: 10.1016/j.jvsv.2024.102162.


Global disparities in drug-related adverse events of patients with multiple myeloma: a pharmacovigilance study.

Jaberi-Douraki M, Xu X, Dima D, Ailawadhi S, Anwer F, Mazzoni S Blood Cancer J. 2024; 14(1):223.

PMID: 39706832 PMC: 11661995. DOI: 10.1038/s41408-024-01206-4.


Setting the balance of care for older adults at risk of hospitalization and delayed discharge: A mixed-methods research protocol.

Kuluski K, Jacobson D, Ghazalbash S, Baek J, Rosella L, Mansfield E PLoS One. 2024; 19(12):e0315918.

PMID: 39689096 PMC: 11651538. DOI: 10.1371/journal.pone.0315918.


References
1.
Lipska K, Warton E, Huang E, Moffet H, Inzucchi S, Krumholz H . HbA1c and risk of severe hypoglycemia in type 2 diabetes: the Diabetes and Aging Study. Diabetes Care. 2013; 36(11):3535-42. PMC: 3816866. DOI: 10.2337/dc13-0610. View

2.
Parrinello C, Matsushita K, Woodward M, Wagenknecht L, Coresh J, Selvin E . Risk prediction of major complications in individuals with diabetes: the Atherosclerosis Risk in Communities Study. Diabetes Obes Metab. 2016; 18(9):899-906. PMC: 4993670. DOI: 10.1111/dom.12686. View

3.
Mehta S, Jackson R, Pylypchuk R, Poppe K, Wells S, Kerr A . Development and validation of alternative cardiovascular risk prediction equations for population health planning: a routine health data linkage study of 1.7 million New Zealanders. Int J Epidemiol. 2018; 47(5):1571-1584. DOI: 10.1093/ije/dyy137. View

4.
Agniel D, Kohane I, Weber G . Biases in electronic health record data due to processes within the healthcare system: retrospective observational study. BMJ. 2018; 361:k1479. PMC: 5925441. DOI: 10.1136/bmj.k1479. View

5.
Rivera L, Lebenbaum M, Rosella L . The influence of socioeconomic status on future risk for developing Type 2 diabetes in the Canadian population between 2011 and 2022: differential associations by sex. Int J Equity Health. 2015; 14:101. PMC: 4619358. DOI: 10.1186/s12939-015-0245-0. View