Identifying Type 1 and 2 Diabetes in Research Datasets Where Classification Biomarkers Are Unavailable: Assessing the Accuracy of Published Approaches
Overview
Authors
Affiliations
Objectives: We aimed to compare the performance of approaches for classifying insulin-treated diabetes within research datasets without measured classification biomarkers, evaluated against two independent biological definitions of diabetes type.
Study Design And Setting: We compared accuracy of ten reported approaches for classifying insulin-treated diabetes into type 1 (T1D) and type 2 (T2D) diabetes in two cohorts: UK Biobank (UKBB) n = 26,399 and Diabetes Alliance for Research in England (DARE) n = 1,296. The overall performance for classifying T1D and T2D was assessed using: a T1D genetic risk score and genetic stratification method (UKBB); C-peptide measured at >3 years diabetes duration (DARE).
Results: Approaches' accuracy ranged from 71% to 88% (UKBB) and 68% to 88% (DARE). When classifying all participants, combining early insulin requirement with a T1D probability model (incorporating diagnosis age and body image issue [BMI]), and interview-reported diabetes type (UKBB available in only 15%) consistently achieved high accuracy (UKBB 87% and 87% and DARE 85% and 88%, respectively). For identifying T1D with minimal misclassification, models with high thresholds or young diagnosis age (<20 years) had highest performance. Findings were incorporated into an online tool identifying optimum approaches based on variable availability.
Conclusion: Models combining continuous features with early insulin requirement are the most accurate methods for classifying insulin-treated diabetes in research datasets without measured classification biomarkers.
Liu T, Sankareswaran A, Paterson G, Fraser D, Hodgson S, Huang Q Sci Rep. 2025; 15(1):1168.
PMID: 39805939 PMC: 11729895. DOI: 10.1038/s41598-024-80348-8.
Hopkins R, Young K, Thomas N, Jones A, Hattersley A, Shields B Diabetes Obes Metab. 2025; 27(3):1544-1553.
PMID: 39762966 PMC: 11802396. DOI: 10.1111/dom.16163.
Arni A, Fraser D, Sharp S, Oram R, Johnson M, Weedon M Sci Rep. 2024; 14(1):31044.
PMID: 39730838 PMC: 11680773. DOI: 10.1038/s41598-024-82278-x.
Ostrominski J, Cho S, Vaduganathan M, Honigberg M Diabetes Obes Metab. 2024; 27(1):422-427.
PMID: 39402726 PMC: 11620920. DOI: 10.1111/dom.16009.
Kizilkaya H, Sorensen K, Madsen J, Lindquist P, Douros J, Bork-Jensen J Nat Metab. 2024; 6(7):1268-1281.
PMID: 38871982 PMC: 11272584. DOI: 10.1038/s42255-024-01061-4.