High-throughput Genetic Clustering of Type 2 Diabetes Loci Reveals Heterogeneous Mechanistic Pathways of Metabolic Disease
Overview
Authors
Affiliations
Aims/hypothesis: Type 2 diabetes is highly polygenic and influenced by multiple biological pathways. Rapid expansion in the number of type 2 diabetes loci can be leveraged to identify such pathways.
Methods: We developed a high-throughput pipeline to enable clustering of type 2 diabetes loci based on variant-trait associations. Our pipeline extracted summary statistics from genome-wide association studies (GWAS) for type 2 diabetes and related traits to generate a matrix of 323 variants × 64 trait associations and applied Bayesian non-negative matrix factorisation (bNMF) to identify genetic components of type 2 diabetes. Epigenomic enrichment analysis was performed in 28 cell types and single pancreatic cells. We generated cluster-specific polygenic scores and performed regression analysis in an independent cohort (N=25,419) to assess for clinical relevance.
Results: We identified ten clusters of genetic loci, recapturing the five from our prior analysis as well as novel clusters related to beta cell dysfunction, pronounced insulin secretion, and levels of alkaline phosphatase, lipoprotein A and sex hormone-binding globulin. Four clusters related to mechanisms of insulin deficiency, five to insulin resistance and one had an unclear mechanism. The clusters displayed tissue-specific epigenomic enrichment, notably with the two beta cell clusters differentially enriched in functional and stressed pancreatic beta cell states. Additionally, cluster-specific polygenic scores were differentially associated with patient clinical characteristics and outcomes. The pipeline was applied to coronary artery disease and chronic kidney disease, identifying multiple overlapping clusters with type 2 diabetes.
Conclusions/interpretation: Our approach stratifies type 2 diabetes loci into physiologically interpretable genetic clusters associated with distinct tissues and clinical outcomes. The pipeline allows for efficient updating as additional GWAS become available and can be readily applied to other conditions, facilitating clinical translation of GWAS findings. Software to perform this clustering pipeline is freely available.
Buraczynska M, Boczkowska S, Zaluska W Diabetes Metab Syndr Obes. 2025; 18:653-661.
PMID: 40034481 PMC: 11874986. DOI: 10.2147/DMSO.S506639.
Loesch D, Garg M, Matelska D, Vitsios D, Jiang X, Ritchie S Nat Commun. 2025; 16(1):2124.
PMID: 40032831 PMC: 11876343. DOI: 10.1038/s41467-025-56695-z.
Identifying chronic obstructive pulmonary disease subtypes using multi-trait genetics.
Ziyatdinov A, Hobbs B, Kanaan-Izquierdo S, Moll M, Sakornsakolpat P, Shrine N EBioMedicine. 2025; 113:105609.
PMID: 40010152 PMC: 11905855. DOI: 10.1016/j.ebiom.2025.105609.
Lessons Learned From Epidemiology of Type 2 Diabetes in South Asians: Kelly West Award Lecture 2024.
Mohan V Diabetes Care. 2025; 48(2):153-163.
PMID: 39841965 PMC: 11770170. DOI: 10.2337/dci24-0046.
Genetic basis of early onset and progression of type 2 diabetes in South Asians.
Hodgson S, Williamson A, Bigossi M, Stow D, Jacobs B, Samuel M Nat Med. 2024; 31(1):323-331.
PMID: 39592779 PMC: 11750703. DOI: 10.1038/s41591-024-03317-8.