» Articles » PMID: 23921631

Quantitative Set Analysis for Gene Expression: a Method to Quantify Gene Set Differential Expression Including Gene-gene Correlations

Overview
Specialty Biochemistry
Date 2013 Aug 8
PMID 23921631
Citations 125
Authors
Affiliations
Soon will be listed here.
Abstract

Enrichment analysis of gene sets is a popular approach that provides a functional interpretation of genome-wide expression data. Existing tests are affected by inter-gene correlations, resulting in a high Type I error. The most widely used test, Gene Set Enrichment Analysis, relies on computationally intensive permutations of sample labels to generate a null distribution that preserves gene-gene correlations. A more recent approach, CAMERA, attempts to correct for these correlations by estimating a variance inflation factor directly from the data. Although these methods generate P-values for detecting gene set activity, they are unable to produce confidence intervals or allow for post hoc comparisons. We have developed a new computational framework for Quantitative Set Analysis of Gene Expression (QuSAGE). QuSAGE accounts for inter-gene correlations, improves the estimation of the variance inflation factor and, rather than evaluating the deviation from a null hypothesis with a P-value, it quantifies gene-set activity with a complete probability density function. From this probability density function, P-values and confidence intervals can be extracted and post hoc analysis can be carried out while maintaining statistical traceability. Compared with Gene Set Enrichment Analysis and CAMERA, QuSAGE exhibits better sensitivity and specificity on real data profiling the response to interferon therapy (in chronic Hepatitis C virus patients) and Influenza A virus infection. QuSAGE is available as an R package, which includes the core functions for the method as well as functions to plot and visualize the results.

Citing Articles

Mitochondrial dysfunction drives a neuronal exhaustion phenotype in methylmalonic aciduria.

Denley M, Straub M, Marcionelli G, Gura M, Penton D, Delvendahl I Commun Biol. 2025; 8(1):410.

PMID: 40069408 PMC: 11897345. DOI: 10.1038/s42003-025-07828-z.


Decoding dengue's neurological assault: insights from single-cell CNS analysis in an immunocompromised mouse model.

Qiu M, Zhao L, Li X, Fan Y, Liu M, Hua D J Neuroinflammation. 2025; 22(1):62.

PMID: 40038739 PMC: 11877810. DOI: 10.1186/s12974-025-03383-w.


Aberrant expression of collagen type X in solid tumor stroma is associated with EMT, immunosuppressive and pro-metastatic pathways, bone marrow stromal cell signatures, and poor survival prognosis.

Famili-Youth E, Famili-Youth A, Yang D, Siddique A, Wu E, Liu W BMC Cancer. 2025; 25(1):247.

PMID: 39939916 PMC: 11823173. DOI: 10.1186/s12885-025-13641-y.


Infiltrating lipid-rich macrophage subpopulations identified as a regulator of increasing prostate size in human benign prostatic hyperplasia.

Lanman N, Meco E, Fitchev P, Kolliegbo A, Broman M, Filipovich Y Front Immunol. 2025; 15:1494476.

PMID: 39867899 PMC: 11757139. DOI: 10.3389/fimmu.2024.1494476.


Single-cell RNA sequencing and spatial transcriptomics of esophageal squamous cell carcinoma with lymph node metastases.

Guo W, Zhou B, Dou L, Guo L, Li Y, Qin J Exp Mol Med. 2024; 57(1):59-71.

PMID: 39741182 PMC: 11799171. DOI: 10.1038/s12276-024-01369-x.


References
1.
Schoggins J, Wilson S, Panis M, Murphy M, Jones C, Bieniasz P . A diverse range of gene products are effectors of the type I interferon antiviral response. Nature. 2011; 472(7344):481-5. PMC: 3409588. DOI: 10.1038/nature09907. View

2.
Fall N, Barnes M, Thornton S, Luyrink L, Olson J, Ilowite N . Gene expression profiling of peripheral blood from patients with untreated new-onset systemic juvenile idiopathic arthritis reveals molecular heterogeneity that may predict macrophage activation syndrome. Arthritis Rheum. 2007; 56(11):3793-804. DOI: 10.1002/art.22981. View

3.
Irizarry R, Wang C, Zhou Y, Speed T . Gene set enrichment analysis made simple. Stat Methods Med Res. 2010; 18(6):565-75. PMC: 3134237. DOI: 10.1177/0962280209351908. View

4.
Yaari G, Uduman M, Kleinstein S . Quantifying selection in high-throughput Immunoglobulin sequencing data sets. Nucleic Acids Res. 2012; 40(17):e134. PMC: 3458526. DOI: 10.1093/nar/gks457. View

5.
Huang Y, Zaas A, Rao A, Dobigeon N, Woolf P, Veldman T . Temporal dynamics of host molecular responses differentiate symptomatic and asymptomatic influenza a infection. PLoS Genet. 2011; 7(8):e1002234. PMC: 3161909. DOI: 10.1371/journal.pgen.1002234. View