» Articles » PMID: 33608043

Performance of Model-based Multifactor Dimensionality Reduction Methods for Epistasis Detection by Controlling Population Structure

Overview
Journal BioData Min
Publisher Biomed Central
Specialty Biology
Date 2021 Feb 20
PMID 33608043
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Background: In genome-wide association studies the extent and impact of confounding due to population structure have been well recognized. Inadequate handling of such confounding is likely to lead to spurious associations, hampering replication, and the identification of causal variants. Several strategies have been developed for protecting associations against confounding, the most popular one is based on Principal Component Analysis. In contrast, the extent and impact of confounding due to population structure in gene-gene interaction association epistasis studies are much less investigated and understood. In particular, the role of nonlinear genetic population substructure in epistasis detection is largely under-investigated, especially outside a regression framework.

Methods: To identify causal variants in synergy, to improve interpretability and replicability of epistasis results, we introduce three strategies based on a model-based multifactor dimensionality reduction approach for structured populations, namely MBMDR-PC, MBMDR-PG, and MBMDR-GC.

Results: Simulation results comparing the performance of various approaches show that in the presence of population structure MBMDR-PC and MBMDR-PG consistently better control type I error rate at the nominal level than MBMDR-GC. Moreover, our proposed three methods of population structure correction outperform MDR-SP in terms of statistical power.

Conclusion: We demonstrate through extensive simulation studies the effect of various degrees of genetic population structure and relatedness on epistasis detection and propose appropriate remedial measures based on linear and nonlinear sample genetic similarity.

Citing Articles

Considerations in the search for epistasis.

Balvert M, Cooper-Knock J, Stamp J, Byrne R, Mourragui S, van Gils J Genome Biol. 2024; 25(1):296.

PMID: 39563431 PMC: 11574992. DOI: 10.1186/s13059-024-03427-z.


* and DOCK1* gene-gene interactions associated with rheumatoid arthritis in the focal adhesion pathway.

Veyssiere M, Rodriguez Ordonez M, Chalabi S, Michou L, Cornelis F, Boland A Front Genet. 2024; 15:1375036.

PMID: 38803542 PMC: 11128622. DOI: 10.3389/fgene.2024.1375036.


Roles of interacting stress-related genes in lifespan regulation: insights for translating experimental findings to humans.

Yashin A, Wu D, Arbeev K, Yashkin A, Akushevich I, Bagley O J Transl Genet Genom. 2021; 5(4):357-379.

PMID: 34825130 PMC: 8612394.

References
1.
Horvath S, Xu X, Laird N . The family based association test method: strategies for studying general genotype--phenotype associations. Eur J Hum Genet. 2001; 9(4):301-6. DOI: 10.1038/sj.ejhg.5200625. View

2.
Abegaz F, Chaichoompu K, Genin E, Fardo D, Konig I, Mahachie John J . Principals about principal components in statistical genetics. Brief Bioinform. 2018; 20(6):2200-2216. DOI: 10.1093/bib/bby081. View

3.
Zhang B, Zhang J, Liu J . BLOCK-BASED BAYESIAN EPISTASIS ASSOCIATION MAPPING WITH APPLICATION TO WTCCC TYPE 1 DIABETES DATA. Ann Appl Stat. 2011; 5(3):2052-2077. PMC: 3226821. DOI: 10.1214/11-AOAS469. View

4.
Wan X, Yang C, Yang Q, Xue H, Fan X, Tang N . BOOST: A fast approach to detecting gene-gene interactions in genome-wide case-control studies. Am J Hum Genet. 2010; 87(3):325-40. PMC: 2933337. DOI: 10.1016/j.ajhg.2010.07.021. View

5.
Kang H, Sul J, Service S, Zaitlen N, Kong S, Freimer N . Variance component model to account for sample structure in genome-wide association studies. Nat Genet. 2010; 42(4):348-54. PMC: 3092069. DOI: 10.1038/ng.548. View