» Articles » PMID: 18466460

Genome-wide Sparse Canonical Correlation of Gene Expression with Genotypes

Overview
Journal BMC Proc
Publisher Biomed Central
Specialty Biology
Date 2008 May 10
PMID 18466460
Citations 23
Authors
Affiliations
Soon will be listed here.
Abstract

There is a growing interest in studying natural variation in human gene expression. Studies mapping genetic determinants of expression profiles are often carried out considering the expression of one gene at a time, an approach that is computationally intensive and may be prone to high false-discovery rate because the number of genes under consideration often exceeds tens of thousands. We present an exploratory method for investigating such data and apply it to the data provided as Problem 1 of Genetic Analysis Workshop 15 (GAW15). In multivariate analysis, canonical correlation analysis is a common way to inspect the relationship between two sets of variables based on their correlation. It determines linear combinations of all variables from each data set such that the correlation between the two linear combinations is maximized. However, due to the large number of genes, linear combinations involving all single-nucleotide polymorphism (SNP) loci and gene expression phenotypes lack biological plausibility and interpretability. We introduce sparse canonical correlation analysis, which examines the relationships of many genetic loci and gene expression phenotypes by providing sparse linear combinations that include only a small subset of loci and gene expression phenotypes. These correlated sets of variables are sufficiently small for biological interpretability and further investigation. Applying this method to the GAW15 Problem 1 data, we identified groups of 41 loci and 150 gene expressions with the highest between-group correlation of 43%.

Citing Articles

Structure-adaptive canonical correlation analysis for microbiome multi-omics data.

Deng L, Tang Y, Zhang X, Chen J Front Genet. 2024; 15:1489694.

PMID: 39655222 PMC: 11626081. DOI: 10.3389/fgene.2024.1489694.


Stable biomarker discovery in multi-omics data via canonical correlation analysis.

Pusa T, Rousu J PLoS One. 2024; 19(9):e0309921.

PMID: 39250478 PMC: 11383239. DOI: 10.1371/journal.pone.0309921.


Uncover spatially informed variations for single-cell spatial transcriptomics with STew.

Guo N, Vargas J, Reynoso S, Fritz D, Krishna R, Wang C Bioinform Adv. 2024; 4(1):vbae064.

PMID: 38827413 PMC: 11142628. DOI: 10.1093/bioadv/vbae064.


Multi-Omics Data Fusion for Cancer Molecular Subtyping Using Sparse Canonical Correlation Analysis.

Qi L, Wang W, Wu T, Zhu L, He L, Wang X Front Genet. 2021; 12:607817.

PMID: 34367231 PMC: 8341864. DOI: 10.3389/fgene.2021.607817.


Set-Wise Differential Interaction Between Copy Number Alterations and Gene Expressions of Lower-Grade Glioma Reveals Prognosis-Associated Pathways.

Cho S Entropy (Basel). 2020; 22(12).

PMID: 33353229 PMC: 7765960. DOI: 10.3390/e22121434.


References
1.
Morley M, Molony C, Weber T, Devlin J, Ewens K, Spielman R . Genetic analysis of genome-wide variation in human gene expression. Nature. 2004; 430(7001):743-7. PMC: 2966974. DOI: 10.1038/nature02797. View

2.
Cheung V, Spielman R, Ewens K, Weber T, Morley M, Burdick J . Mapping determinants of human gene expression by regional and genome-wide association. Nature. 2005; 437(7063):1365-9. PMC: 3005311. DOI: 10.1038/nature04244. View

3.
Lantieri F, Rydbeck H, Griseri P, Ceccherini I, Devoto M . Incorporating prior biological information in linkage studies increases power and limits multiple testing. BMC Proc. 2008; 1 Suppl 1:S89. PMC: 2367562. DOI: 10.1186/1753-6561-1-s1-s89. View

4.
Tritchler D, Liu Y, Fallah S . A test of linkage for complex discrete and continuous traits in nuclear families. Biometrics. 2003; 59(2):382-92. DOI: 10.1111/1541-0420.00045. View

5.
Wang Y, Fang Y, Wang S . Clustering and principal-components approach based on heritability for mapping multiple gene expressions. BMC Proc. 2008; 1 Suppl 1:S121. PMC: 2367519. DOI: 10.1186/1753-6561-1-s1-s121. View