» Articles » PMID: 38834334

Identifying Subtypes of Type 2 Diabetes Mellitus with Machine Learning: Development, Internal Validation, Prognostic Validation and Medication Burden in Linked Electronic Health Records in 420 448 Individuals

Overview
Specialty Endocrinology
Date 2024 Jun 4
PMID 38834334
Authors
Affiliations
Soon will be listed here.
Abstract

Introduction: None of the studies of type 2 diabetes (T2D) subtyping to date have used linked population-level data for incident and prevalent T2D, incorporating a diverse set of variables, explainable methods for cluster characterization, or adhered to an established framework. We aimed to develop and validate machine learning (ML)-informed subtypes for type 2 diabetes mellitus (T2D) using nationally representative data.

Research Design And Methods: In population-based electronic health records (2006-2020; Clinical Practice Research Datalink) in individuals ≥18 years with incident T2D (n=420 448), we included factors (n=3787), including demography, history, examination, biomarkers and medications. Using a published framework, we identified subtypes through nine unsupervised ML methods (K-means, K-means++, K-mode, K-prototype, mini-batch, agglomerative hierarchical clustering, Birch, Gaussian mixture models, and consensus clustering). We characterized clusters using intracluster distributions and explainable artificial intelligence (AI) techniques. We evaluated subtypes for (1) internal validity (within dataset; across methods); (2) prognostic validity (prediction for 5-year all-cause mortality, hospitalization and new chronic diseases); and (3) medication burden.

Results: : We identified four T2D subtypes: metabolic, early onset, late onset and cardiometabolic. : Subtypes were predicted with high accuracy (F1 score >0.98). : 5-year all-cause mortality, hospitalization, new chronic disease incidence and medication burden differed across T2D subtypes. Compared with the metabolic subtype, 5-year risks of mortality and hospitalization in incident T2D were highest in late-onset subtype (HR 1.95, 1.85-2.05 and 1.66, 1.58-1.75) and lowest in early-onset subtype (1.18, 1.11-1.27 and 0.85, 0.80-0.90). Incidence of chronic diseases was highest in late-onset subtype and lowest in early-onset subtype. : Compared with the metabolic subtype, after adjusting for age, sex, and pre-T2D medications, late-onset subtype (1.31, 1.28-1.35) and early-onset subtype (0.83, 0.81-0.85) were most and least likely, respectively, to be prescribed medications within 5 years following T2D onset.

Conclusions: In the largest study using ML to date in incident T2D, we identified four distinct subtypes, with potential future implications for etiology, therapeutics, and risk prediction.

Citing Articles

Intra-Individual Variations in How Insulin Sensitivity Responds to Long-Term Exercise: Predictions by Machine Learning Based on Large-Scale Serum Proteomics.

Viken J, Olsen T, Drevon C, Hjorth M, Birkeland K, Norheim F Metabolites. 2024; 14(6).

PMID: 38921470 PMC: 11206077. DOI: 10.3390/metabo14060335.

References
1.
Leslie R, Ma R, Franks P, Nadeau K, Pearson E, Redondo M . Understanding diabetes heterogeneity: key steps towards precision medicine in diabetes. Lancet Diabetes Endocrinol. 2023; 11(11):848-860. DOI: 10.1016/S2213-8587(23)00159-6. View

2.
Banerjee A, Chen S, Fatemifar G, Zeina M, Lumbers R, Mielke J . Machine learning for subtype definition and risk prediction in heart failure, acute coronary syndromes and atrial fibrillation: systematic review of validity and clinical utility. BMC Med. 2021; 19(1):85. PMC: 8022365. DOI: 10.1186/s12916-021-01940-7. View

3.
Zheng Y, Ley S, Hu F . Global aetiology and epidemiology of type 2 diabetes mellitus and its complications. Nat Rev Endocrinol. 2017; 14(2):88-98. DOI: 10.1038/nrendo.2017.151. View

4.
Wang Y, Katzmarzyk P, Horswell R, Zhao W, Johnson J, Hu G . Comparison of the heart failure risk stratification performance of the CKD-EPI equation and the MDRD equation for estimated glomerular filtration rate in patients with Type 2 diabetes. Diabet Med. 2015; 33(5):609-20. PMC: 4723290. DOI: 10.1111/dme.12859. View

5.
Ahlqvist E, Prasad R, Groop L . Subtypes of Type 2 Diabetes Determined From Clinical Parameters. Diabetes. 2020; 69(10):2086-2093. DOI: 10.2337/dbi20-0001. View