» Articles » PMID: 37398898

A COMPARISON OF PRINCIPAL COMPONENT METHODS BETWEEN MULTIPLE PHENOTYPE REGRESSION AND MULTIPLE SNP REGRESSION IN GENETIC ASSOCIATION STUDIES

Overview
Journal Ann Appl Stat
Date 2023 Jul 3
PMID 37398898
Authors
Affiliations
Soon will be listed here.
Abstract

Principal component analysis (PCA) is a popular method for dimension reduction in unsupervised multivariate analysis. However, existing ad hoc uses of PCA in both multivariate regression (multiple outcomes) and multiple regression (multiple predictors) lack theoretical justification. The differences in the statistical properties of PCAs in these two regression settings are not well understood. In this paper we provide theoretical results on the power of PCA in genetic association testings in both multiple phenotype and SNP-set settings. The multiple phenotype setting refers to the case when one is interested in studying the association between a single SNP and multiple phenotypes as outcomes. The SNP-set setting refers to the case when one is interested in studying the association between multiple SNPs in a SNP set and a single phenotype as the outcome. We demonstrate analytically that the properties of the PC-based analysis in these two regression settings are substantially different. We show that the lower order PCs, that is, PCs with large eigenvalues, are generally preferred and lead to a higher power in the SNP-set setting, while the higher-order PCs, that is, PCs with small eigenvalues, are generally preferred in the multiple phenotype setting. We also investigate the power of three other popular statistical methods, the Wald test, the variance component test and the minimum -value test, in both multiple phenotype and SNP-set settings. We use theoretical power, simulation studies, and two real data analyses to validate our findings.

Citing Articles

A Bayesian fine-mapping model using a continuous global-local shrinkage prior with applications in prostate cancer analysis.

Li X, Sham P, Zhang Y Am J Hum Genet. 2024; 111(2):213-226.

PMID: 38171363 PMC: 10870138. DOI: 10.1016/j.ajhg.2023.12.007.


Similarity-based multimodal regression.

Chen A, Weinstein S, Adebimpe A, Gur R, Gur R, Merikangas K Biostatistics. 2023; 25(4):1122-1139.

PMID: 38058018 PMC: 11471965. DOI: 10.1093/biostatistics/kxad033.


Differences in set-based tests for sparse alternatives when testing sets of outcomes compared to sets of explanatory factors in genetic association studies.

Sun R, Shi A, Lin X Biostatistics. 2022; 25(1):171-187.

PMID: 36000269 PMC: 10724113. DOI: 10.1093/biostatistics/kxac036.


A Multi-Marker Test for Analyzing Paired Genetic Data in Transplantation.

Arthur V, Li Z, Cao R, Oetting W, Israni A, Jacobson P Front Genet. 2021; 12:745773.

PMID: 34721531 PMC: 8548646. DOI: 10.3389/fgene.2021.745773.


An Omnibus Test for Detecting Multiple Phenotype Associations Based on GWAS Summary Level Data.

Liu W, Guo Y, Liu Z Front Genet. 2021; 12:644419.

PMID: 33815478 PMC: 8009968. DOI: 10.3389/fgene.2021.644419.

References
1.
Huang Y, Lin X . Gene set analysis using variance component tests. BMC Bioinformatics. 2013; 14:210. PMC: 3776447. DOI: 10.1186/1471-2105-14-210. View

2.
Zhang F, Guo X, Wu S, Han J, Liu Y, Shen H . Genome-wide pathway association studies of multiple correlated quantitative phenotypes using principle component analyses. PLoS One. 2013; 7(12):e53320. PMC: 3532454. DOI: 10.1371/journal.pone.0053320. View

3.
Stephens M . A unified framework for association analysis with multiple related phenotypes. PLoS One. 2013; 8(7):e65245. PMC: 3702528. DOI: 10.1371/journal.pone.0065245. View

4.
Karasik D, Cheung C, Zhou Y, Cupples L, Kiel D, Demissie S . Genome-wide association of an integrated osteoporosis-related phenotype: is there evidence for pleiotropic genes?. J Bone Miner Res. 2011; 27(2):319-30. PMC: 3290743. DOI: 10.1002/jbmr.563. View

5.
Solovieff N, Cotsapas C, Lee P, Purcell S, Smoller J . Pleiotropy in complex traits: challenges and strategies. Nat Rev Genet. 2013; 14(7):483-95. PMC: 4104202. DOI: 10.1038/nrg3461. View