» Articles » PMID: 20560208

Powerful SNP-set Analysis for Case-control Genome-wide Association Studies

Overview
Journal Am J Hum Genet
Publisher Cell Press
Specialty Genetics
Date 2010 Jun 22
PMID 20560208
Citations 360
Authors
Affiliations
Soon will be listed here.
Abstract

GWAS have emerged as popular tools for identifying genetic variants that are associated with disease risk. Standard analysis of a case-control GWAS involves assessing the association between each individual genotyped SNP and disease risk. However, this approach suffers from limited reproducibility and difficulties in detecting multi-SNP and epistatic effects. As an alternative analytical strategy, we propose grouping SNPs together into SNP sets on the basis of proximity to genomic features such as genes or haplotype blocks, then testing the joint effect of each SNP set. Testing of each SNP set proceeds via the logistic kernel-machine-based test, which is based on a statistical framework that allows for flexible modeling of epistatic and nonlinear SNP effects. This flexibility and the ability to naturally adjust for covariate effects are important features of our test that make it appealing in comparison to individual SNP tests and existing multimarker tests. Using simulated data based on the International HapMap Project, we show that SNP-set testing can have improved power over standard individual-SNP analysis under a wide range of settings. In particular, we find that our approach has higher power than individual-SNP analysis when the median correlation between the disease-susceptibility variant and the genotyped SNPs is moderate to high. When the correlation is low, both individual-SNP analysis and the SNP-set analysis tend to have low power. We apply SNP-set analysis to analyze the Cancer Genetic Markers of Susceptibility (CGEMS) breast cancer GWAS discovery-phase data.

Citing Articles

Genetic Architecture Underlying Response to the Fungal Pathogen in Lodgepole Pine, Jack Pine, and Their Hybrids.

Lu M, Feau N, Lind B, Obreht Vidakovic D, Singh P, Aitken S Evol Appl. 2025; 18(2):e70078.

PMID: 39925618 PMC: 11802335. DOI: 10.1111/eva.70078.


A powerful framework for differential co-expression analysis of general risk factors.

Bass A, Cutler D, Epstein M bioRxiv. 2024; .

PMID: 39677786 PMC: 11642831. DOI: 10.1101/2024.11.29.626006.


Screening the Best Risk Model and Susceptibility SNPs for Chronic Obstructive Pulmonary Disease (COPD) Based on Machine Learning Algorithms.

Yang Z, Zheng Y, Zhang L, Zhao J, Xu W, Wu H Int J Chron Obstruct Pulmon Dis. 2024; 19:2397-2414.

PMID: 39525518 PMC: 11549878. DOI: 10.2147/COPD.S478634.


Multiome-wide Association Studies: Novel Approaches for Understanding Diseases.

Shao M, Chen K, Zhang S, Tian M, Shen Y, Cao C Genomics Proteomics Bioinformatics. 2024; 22(5).

PMID: 39471467 PMC: 11630051. DOI: 10.1093/gpbjnl/qzae077.


Spatial pattern and differential expression analysis with spatial transcriptomic data.

Qin F, Luo X, Lu Q, Cai B, Xiao F, Cai G Nucleic Acids Res. 2024; 52(21):e101.

PMID: 39470725 PMC: 11602167. DOI: 10.1093/nar/gkae962.


References
1.
Tachmazidou I, Verzilli C, De Iorio M . Genetic association mapping via evolution-based clustering of haplotypes. PLoS Genet. 2007; 3(7):e111. PMC: 1913101. DOI: 10.1371/journal.pgen.0030111. View

2.
Lin W, Schaid D . Power comparisons between similarity-based multilocus association methods, logistic regression, and score tests for haplotypes. Genet Epidemiol. 2008; 33(3):183-97. PMC: 2674317. DOI: 10.1002/gepi.20364. View

3.
Lin D . An efficient Monte Carlo approach to assessing statistical significance in genomic studies. Bioinformatics. 2004; 21(6):781-7. DOI: 10.1093/bioinformatics/bti053. View

4.
Gudmundsson J, Sulem P, Manolescu A, Amundadottir L, Gudbjartsson D, Helgason A . Genome-wide association study identifies a second prostate cancer susceptibility variant at 8q24. Nat Genet. 2007; 39(5):631-7. DOI: 10.1038/ng1999. View

5.
Thomas G, Jacobs K, Yeager M, Kraft P, Wacholder S, Orr N . Multiple loci identified in a genome-wide association study of prostate cancer. Nat Genet. 2008; 40(3):310-5. DOI: 10.1038/ng.91. View