» Articles » PMID: 29040274

A Quadratically Regularized Functional Canonical Correlation Analysis for Identifying the Global Structure of Pleiotropy with NGS Data

Overview
Specialty Biology
Date 2017 Oct 18
PMID 29040274
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Investigating the pleiotropic effects of genetic variants can increase statistical power, provide important information to achieve deep understanding of the complex genetic structures of disease, and offer powerful tools for designing effective treatments with fewer side effects. However, the current multiple phenotype association analysis paradigm lacks breadth (number of phenotypes and genetic variants jointly analyzed at the same time) and depth (hierarchical structure of phenotype and genotypes). A key issue for high dimensional pleiotropic analysis is to effectively extract informative internal representation and features from high dimensional genotype and phenotype data. To explore correlation information of genetic variants, effectively reduce data dimensions, and overcome critical barriers in advancing the development of novel statistical methods and computational algorithms for genetic pleiotropic analysis, we proposed a new statistic method referred to as a quadratically regularized functional CCA (QRFCCA) for association analysis which combines three approaches: (1) quadratically regularized matrix factorization, (2) functional data analysis and (3) canonical correlation analysis (CCA). Large-scale simulations show that the QRFCCA has a much higher power than that of the ten competing statistics while retaining the appropriate type 1 errors. To further evaluate performance, the QRFCCA and ten other statistics are applied to the whole genome sequencing dataset from the TwinsUK study. We identify a total of 79 genes with rare variants and 67 genes with common variants significantly associated with the 46 traits using QRFCCA. The results show that the QRFCCA substantially outperforms the ten other statistics.

Citing Articles

DNA Methylation Changes and Phenotypic Adaptations Induced Repeated Extreme Altitude Exposure at 8848 Meters.

Zhang S, Yang L, Duoji Z, Qiangba D, Hu X, Jiang Z Int J Mol Sci. 2024; 25(23).

PMID: 39684363 PMC: 11641581. DOI: 10.3390/ijms252312652.


UGT1A1 genetic variants are associated with increases in bilirubin levels in rheumatoid arthritis patients treated with sarilumab.

Lin N, Damask A, Boyapati A, Hamilton J, Hamon S, Ternes N Pharmacogenomics J. 2022; 22(3):160-165.

PMID: 35149777 PMC: 9151390. DOI: 10.1038/s41397-022-00269-5.


Integrative functional linear model for genome-wide association studies with multiple traits.

Li Y, Wang F, Wu M, Ma S Biostatistics. 2020; 23(2):574-590.

PMID: 33040145 PMC: 9007435. DOI: 10.1093/biostatistics/kxaa043.


Bivariate Causal Discovery and Its Applications to Gene Expression and Imaging Data Analysis.

Jiao R, Lin N, Hu Z, Bennett D, Jin L, Xiong M Front Genet. 2018; 9:347.

PMID: 30233639 PMC: 6127271. DOI: 10.3389/fgene.2018.00347.

References
1.
Ye B, Ge Y, Perens G, Hong L, Xu H, Fishbein M . Canonical Wnt/β-catenin signaling in epicardial fibrosis of failed pediatric heart allografts with diastolic dysfunction. Cardiovasc Pathol. 2012; 22(1):54-7. PMC: 3427707. DOI: 10.1016/j.carpath.2012.03.004. View

2.
Walter K, Min J, Huang J, Crooks L, Memari Y, McCarthy S . The UK10K project identifies rare variants in health and disease. Nature. 2015; 526(7571):82-90. PMC: 4773891. DOI: 10.1038/nature14962. View

3.
Furlotte N, Eskin E . Efficient multiple-trait association and estimation of genetic correlation using the matrix-variate linear mixed model. Genetics. 2015; 200(1):59-68. PMC: 4423381. DOI: 10.1534/genetics.114.171447. View

4.
Vuckovic D, Gasparini P, Soranzo N, Iotchkova V . MultiMeta: an R package for meta-analyzing multi-phenotype genome-wide association studies. Bioinformatics. 2015; 31(16):2754-6. PMC: 4528637. DOI: 10.1093/bioinformatics/btv222. View

5.
Kim J, Bai Y, Pan W . An Adaptive Association Test for Multiple Phenotypes with GWAS Summary Statistics. Genet Epidemiol. 2015; 39(8):651-63. PMC: 4715495. DOI: 10.1002/gepi.21931. View