» Articles » PMID: 24995866

Rare-variant Association Analysis: Study Designs and Statistical Tests

Overview
Journal Am J Hum Genet
Publisher Cell Press
Specialty Genetics
Date 2014 Jul 5
PMID 24995866
Citations 540
Authors
Affiliations
Soon will be listed here.
Abstract

Despite the extensive discovery of trait- and disease-associated common variants, much of the genetic contribution to complex traits remains unexplained. Rare variants can explain additional disease risk or trait variability. An increasing number of studies are underway to identify trait- and disease-associated rare variants. In this review, we provide an overview of statistical issues in rare-variant association studies with a focus on study designs and statistical tests. We present the design and analysis pipeline of rare-variant studies and review cost-effective sequencing designs and genotyping platforms. We compare various gene- or region-based association tests, including burden tests, variance-component tests, and combined omnibus tests, in terms of their assumptions and performance. Also discussed are the related topics of meta-analysis, population-stratification adjustment, genotype imputation, follow-up studies, and heritability due to rare variants. We provide guidelines for analysis and discuss some of the challenges inherent in these studies and future research directions.

Citing Articles

A Minimax Optimal Ridge-Type Set Test for Global Hypothesis with Applications in Whole Genome Sequencing Association Studies.

Liu Y, Li Z, Lin X J Am Stat Assoc. 2025; 117(538):897-908.

PMID: 40017563 PMC: 11865954. DOI: 10.1080/01621459.2020.1831926.


Ensemble methods for testing a global null.

Liu Y, Liu Z, Lin X J R Stat Soc Series B Stat Methodol. 2025; 86(2):461-486.

PMID: 40012608 PMC: 11864748. DOI: 10.1093/jrsssb/qkad131.


Assessment of the functionality and usability of open-source rare variant analysis pipelines.

Riccio C, Jansen M, Thalen F, Koliopanos G, Link V, Ziegler A Brief Bioinform. 2025; 26(1).

PMID: 39907318 PMC: 11795309. DOI: 10.1093/bib/bbaf044.


Exome sequencing of UK birth cohorts.

Koko M, Fabian L, Popov I, Eberhardt R, Zakharov G, Huang Q Wellcome Open Res. 2025; 9:390.

PMID: 39839975 PMC: 11747307. DOI: 10.12688/wellcomeopenres.22697.2.


The Role of and Ultra-Rare Variants in Hirschsprung Disease (HSCR): Extended Gene Discovery for Risk Profiling of Patients.

Fu M, Berk-Rauch H, Chatterjee S, Chakravarti A medRxiv. 2025; .

PMID: 39830246 PMC: 11741498. DOI: 10.1101/2025.01.07.25320162.


References
1.
Pruitt K, Harrow J, Harte R, Wallin C, Diekhans M, Maglott D . The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes. Genome Res. 2009; 19(7):1316-23. PMC: 2704439. DOI: 10.1101/gr.080531.108. View

2.
Lin D, Zeng D, Tang Z . Quantitative trait analysis in sequencing studies under trait-dependent sampling. Proc Natl Acad Sci U S A. 2013; 110(30):12247-52. PMC: 3725118. DOI: 10.1073/pnas.1221713110. View

3.
Lee J, Parkes M . Genome-wide association studies and Crohn's disease. Brief Funct Genomics. 2011; 10(2):71-6. DOI: 10.1093/bfgp/elr009. View

4.
Lee S, Wright F, Zou F . Control of population stratification by correlation-selected principal components. Biometrics. 2010; 67(3):967-74. PMC: 3117098. DOI: 10.1111/j.1541-0420.2010.01520.x. View

5.
Quintana M, Berstein J, Thomas D, Conti D . Incorporating model uncertainty in detecting rare variants: the Bayesian risk index. Genet Epidemiol. 2011; 35(7):638-49. PMC: 3936341. DOI: 10.1002/gepi.20613. View