» Articles » PMID: 36368478

Identifying Type 1 and 2 Diabetes in Research Datasets Where Classification Biomarkers Are Unavailable: Assessing the Accuracy of Published Approaches

Overview
Publisher Elsevier
Specialty Public Health
Date 2022 Nov 11
PMID 36368478
Authors
Affiliations
Soon will be listed here.
Abstract

Objectives: We aimed to compare the performance of approaches for classifying insulin-treated diabetes within research datasets without measured classification biomarkers, evaluated against two independent biological definitions of diabetes type.

Study Design And Setting: We compared accuracy of ten reported approaches for classifying insulin-treated diabetes into type 1 (T1D) and type 2 (T2D) diabetes in two cohorts: UK Biobank (UKBB) n = 26,399 and Diabetes Alliance for Research in England (DARE) n = 1,296. The overall performance for classifying T1D and T2D was assessed using: a T1D genetic risk score and genetic stratification method (UKBB); C-peptide measured at >3 years diabetes duration (DARE).

Results: Approaches' accuracy ranged from 71% to 88% (UKBB) and 68% to 88% (DARE). When classifying all participants, combining early insulin requirement with a T1D probability model (incorporating diagnosis age and body image issue [BMI]), and interview-reported diabetes type (UKBB available in only 15%) consistently achieved high accuracy (UKBB 87% and 87% and DARE 85% and 88%, respectively). For identifying T1D with minimal misclassification, models with high thresholds or young diagnosis age (<20 years) had highest performance. Findings were incorporated into an online tool identifying optimum approaches based on variable availability.

Conclusion: Models combining continuous features with early insulin requirement are the most accurate methods for classifying insulin-treated diabetes in research datasets without measured classification biomarkers.

Citing Articles

Investigating misclassification of type 1 diabetes in a population-based cohort of British Pakistanis and Bangladeshis using polygenic risk scores.

Liu T, Sankareswaran A, Paterson G, Fraser D, Hodgson S, Huang Q Sci Rep. 2025; 15(1):1168.

PMID: 39805939 PMC: 11729895. DOI: 10.1038/s41598-024-80348-8.


Treatment outcomes with oral anti-hyperglycaemic therapies in people with diabetes secondary to a pancreatic condition (type 3c diabetes): A population-based cohort study.

Hopkins R, Young K, Thomas N, Jones A, Hattersley A, Shields B Diabetes Obes Metab. 2025; 27(3):1544-1553.

PMID: 39762966 PMC: 11802396. DOI: 10.1111/dom.16163.


Type 1 diabetes genetic risk score variation across ancestries using whole genome sequencing and array-based approaches.

Arni A, Fraser D, Sharp S, Oram R, Johnson M, Weedon M Sci Rep. 2024; 14(1):31044.

PMID: 39730838 PMC: 11680773. DOI: 10.1038/s41598-024-82278-x.


Cause-specific death in adults with type 1 diabetes and type 2 diabetes: Insights from the UK Biobank.

Ostrominski J, Cho S, Vaduganathan M, Honigberg M Diabetes Obes Metab. 2024; 27(1):422-427.

PMID: 39402726 PMC: 11620920. DOI: 10.1111/dom.16009.


Characterization of genetic variants of GIPR reveals a contribution of β-arrestin to metabolic phenotypes.

Kizilkaya H, Sorensen K, Madsen J, Lindquist P, Douros J, Bork-Jensen J Nat Metab. 2024; 6(7):1268-1281.

PMID: 38871982 PMC: 11272584. DOI: 10.1038/s42255-024-01061-4.