» Articles » PMID: 23323831

GSVA: Gene Set Variation Analysis for Microarray and RNA-seq Data

Overview
Publisher Biomed Central
Specialty Biology
Date 2013 Jan 18
PMID 23323831
Citations 6672
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Gene set enrichment (GSE) analysis is a popular framework for condensing information from gene expression profiles into a pathway or signature summary. The strengths of this approach over single gene analysis include noise and dimension reduction, as well as greater biological interpretability. As molecular profiling experiments move beyond simple case-control studies, robust and flexible GSE methodologies are needed that can model pathway activity within highly heterogeneous data sets.

Results: To address this challenge, we introduce Gene Set Variation Analysis (GSVA), a GSE method that estimates variation of pathway activity over a sample population in an unsupervised manner. We demonstrate the robustness of GSVA in a comparison with current state of the art sample-wise enrichment methods. Further, we provide examples of its utility in differential pathway activity and survival analysis. Lastly, we show how GSVA works analogously with data from both microarray and RNA-seq experiments.

Conclusions: GSVA provides increased power to detect subtle pathway activity changes over a sample population in comparison to corresponding methods. While GSE methods are generally regarded as end points of a bioinformatic analysis, GSVA constitutes a starting point to build pathway-centric models of biology. Moreover, GSVA contributes to the current need of GSE methods for RNA-seq data. GSVA is an open source software package for R which forms part of the Bioconductor project and can be downloaded at http://www.bioconductor.org.

Citing Articles

Lung Cancer Biomarker Database (LCBD): a comprehensive and curated repository of lung cancer biomarkers.

Li Y, Tong Z, Yang Y, Wang Y, Wen L, Li Y BMC Cancer. 2025; 25(1):478.

PMID: 40089661 DOI: 10.1186/s12885-025-13883-w.


Combination therapy with Chicoric acid and PD-1/PD-L1 blockade improves the immunotherapy response in patient-derived ovarian cancer xenograft model.

Lan H, Zhu J, Hou H, Zhang C, Huo X, Zhang Y Cell Commun Signal. 2025; 23(1):137.

PMID: 40087780 DOI: 10.1186/s12964-025-02146-7.


NLRP4 unlocks an NK/macrophages-centered ecosystem to suppress non-small cell lung cancer.

Meng Z, Li J, Wang H, Cao Z, Lu W, Niu X Biomark Res. 2025; 13(1):44.

PMID: 40087771 DOI: 10.1186/s40364-025-00756-4.


Combined transcriptomic and proteomic analyses reveal relevant myelin features in mice with ischemic stroke.

Qian Q, Lyu H, Wang W, Wang Q, Li D, Liu X Funct Integr Genomics. 2025; 25(1):64.

PMID: 40085348 DOI: 10.1007/s10142-025-01573-6.


Identifying potential biomarkers and molecular mechanisms related to arachidonic acid metabolism in vitiligo.

Li X, Yang L, Zhu L, Sun J, Xu C, Sun L Front Mol Biosci. 2025; 12:1536477.

PMID: 40078960 PMC: 11896865. DOI: 10.3389/fmolb.2025.1536477.


References
1.
Dorum G, Snipen L, Solheim M, Saebo S . Rotation testing in gene set enrichment analysis for small direct comparison experiments. Stat Appl Genet Mol Biol. 2009; 8:Article34. DOI: 10.2202/1544-6115.1418. View

2.
Kim S, Volsky D . PAGE: parametric analysis of gene set enrichment. BMC Bioinformatics. 2005; 6:144. PMC: 1183189. DOI: 10.1186/1471-2105-6-144. View

3.
Tenenbaum J, Walker M, Utz P, Butte A . Expression-based Pathway Signature Analysis (EPSA): mining publicly available microarray data for insight into human disease. BMC Med Genomics. 2008; 1:51. PMC: 2588448. DOI: 10.1186/1755-8794-1-51. View

4.
Levine D, Haynor D, Castle J, Stepaniants S, Pellegrini M, Mao M . Pathway and gene-set activation measurement from mRNA expression data: the tissue distribution of human pathways. Genome Biol. 2006; 7(10):R93. PMC: 1794557. DOI: 10.1186/gb-2006-7-10-r93. View

5.
Jung K, Becker B, Brunner E, Beissbarth T . Comparison of global tests for functional gene sets in two-group designs and selection of potentially effect-causing genes. Bioinformatics. 2011; 27(10):1377-83. DOI: 10.1093/bioinformatics/btr152. View