» Articles » PMID: 31548641

Model-based Clustering for Identifying Disease-associated SNPs in Case-control Genome-wide Association Studies

Overview
Journal Sci Rep
Specialty Science
Date 2019 Sep 25
PMID 31548641
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Genome-wide association studies (GWASs) aim to detect genetic risk factors for complex human diseases by identifying disease-associated single-nucleotide polymorphisms (SNPs). The traditional SNP-wise approach along with multiple testing adjustment is over-conservative and lack of power in many GWASs. In this article, we proposed a model-based clustering method that transforms the challenging high-dimension-small-sample-size problem to low-dimension-large-sample-size problem and borrows information across SNPs by grouping SNPs into three clusters. We pre-specify the patterns of clusters by minor allele frequencies of SNPs between cases and controls, and enforce the patterns with prior distributions. In the simulation studies our proposed novel model outperforms traditional SNP-wise approach by showing better controls of false discovery rate (FDR) and higher sensitivity. We re-analyzed two real studies to identifying SNPs associated with severe bortezomib-induced peripheral neuropathy (BiPN) in patients with multiple myeloma (MM). The original analysis in the literature failed to identify SNPs after FDR adjustment. Our proposed method not only detected the reported SNPs after FDR adjustment but also discovered a novel BiPN-associated SNP rs4351714 that has been reported to be related to MM in another study.

Citing Articles

An Extended Application of the Fast Multi-Locus Ridge Regression Algorithm in Genome-Wide Association Studies of Categorical Phenotypes.

Zhang J, Shen B, Zhou Z, Cai M, Wu X, Han L Plants (Basel). 2024; 13(17).

PMID: 39274004 PMC: 11397509. DOI: 10.3390/plants13172520.


A machine learning-based SNP-set analysis approach for identifying disease-associated susceptibility loci.

Silva P, Gaudillo J, Vilela J, Roxas-Villanueva R, Tiangco B, Domingo M Sci Rep. 2022; 12(1):15817.

PMID: 36138111 PMC: 9499949. DOI: 10.1038/s41598-022-19708-1.


Radiomics Models Based on Magnetic Resonance Imaging for Prediction of the Response to Bortezomib-Based Therapy in Patients with Multiple Myeloma.

Li Y, Yin P, Liu Y, Hao C, Chen L, Sun C Biomed Res Int. 2022; 2022:6911246.

PMID: 36105939 PMC: 9467708. DOI: 10.1155/2022/6911246.


Paeoniflorin Ameliorates BiPN by Reducing IL6 Levels and Regulating PARKIN-Mediated Mitochondrial Autophagy.

Sun R, Liu J, Yu M, Xia M, Zhang Y, Sun X Drug Des Devel Ther. 2022; 16:2241-2259.

PMID: 35860525 PMC: 9289176. DOI: 10.2147/DDDT.S369111.


Application of Deep Learning in Plant-Microbiota Association Analysis.

Deng Z, Zhang J, Li J, Zhang X Front Genet. 2021; 12:697090.

PMID: 34691142 PMC: 8531731. DOI: 10.3389/fgene.2021.697090.


References
1.
Broyl A, Corthals S, Jongen J, van der Holt B, Kuiper R, de Knegt Y . Mechanisms of peripheral neuropathy associated with bortezomib and vincristine in patients with newly diagnosed multiple myeloma: a prospective analysis of data from the HOVON-65/GMMG-HD4 trial. Lancet Oncol. 2010; 11(11):1057-65. DOI: 10.1016/S1470-2045(10)70206-0. View

2.
Corthals S, Kuiper R, Johnson D, Sonneveld P, Hajek R, van der Holt B . Genetic factors underlying the risk of bortezomib induced peripheral neuropathy in multiple myeloma patients. Haematologica. 2011; 96(11):1728-32. PMC: 3208695. DOI: 10.3324/haematol.2011.041434. View

3.
Lo K, Gottardo R . Flexible empirical Bayes models for differential gene expression. Bioinformatics. 2006; 23(3):328-35. DOI: 10.1093/bioinformatics/btl612. View

4.
Fernando R, Garrick D . Bayesian methods applied to GWAS. Methods Mol Biol. 2013; 1019:237-74. DOI: 10.1007/978-1-62703-447-0_10. View

5.
Raab M, Podar K, Breitkreutz I, Richardson P, Anderson K . Multiple myeloma. Lancet. 2009; 374(9686):324-39. DOI: 10.1016/S0140-6736(09)60221-X. View