» Articles » PMID: 16873460

Analysis of Sample Set Enrichment Scores: Assaying the Enrichment of Sets of Genes for Individual Samples in Genome-wide Expression Profiles

Overview
Journal Bioinformatics
Specialty Biology
Date 2006 Jul 29
PMID 16873460
Citations 37
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Gene expression profiling experiments in cell lines and animal models characterized by specific genetic or molecular perturbations have yielded sets of genes annotated by the perturbation. These gene sets can serve as a reference base for interrogating other expression datasets. For example, a new dataset in which a specific pathway gene set appears to be enriched, in terms of multiple genes in that set evidencing expression changes, can then be annotated by that reference pathway. We introduce in this paper a formal statistical method to measure the enrichment of each sample in an expression dataset. This allows us to assay the natural variation of pathway activity in observed gene expression data sets from clinical cancer and other studies.

Results: Validation of the method and illustrations of biological insights gleaned are demonstrated on cell line data, mouse models, and cancer-related datasets. Using oncogenic pathway signatures, we show that gene sets built from a model system are indeed enriched in the model system. We employ ASSESS for the use of molecular classification by pathways. This provides an accurate classifier that can be interpreted at the level of pathways instead of individual genes. Finally, ASSESS can be used for cross-platform expression models where data on the same type of cancer are integrated over different platforms into a space of enrichment scores.

Availability: Versions are available in Octave and Java (with a graphical user interface). Software can be downloaded at http://people.genome.duke.edu/assess.

Citing Articles

Sensitive detection of synthetic response to cancer immunotherapy driven by gene paralog pairs.

Dong C, Zhang F, He E, Ren P, Verma N, Zhu X bioRxiv. 2024; .

PMID: 39005443 PMC: 11245041. DOI: 10.1101/2024.07.02.601809.


Survival analysis of pathway activity as a prognostic determinant in breast cancer.

Jeuken G, Tobin N, Kall L PLoS Comput Biol. 2022; 18(3):e1010020.

PMID: 35344554 PMC: 8989354. DOI: 10.1371/journal.pcbi.1010020.


Comparison of pathway and gene-level models for cancer prognosis prediction.

Zheng X, Amos C, Frost H BMC Bioinformatics. 2020; 21(1):76.

PMID: 32111152 PMC: 7048092. DOI: 10.1186/s12859-020-3423-z.


Graph- and rule-based learning algorithms: a comprehensive review of their applications for cancer type classification and prognosis using genomic data.

Mallik S, Zhao Z Brief Bioinform. 2019; 21(2):368-394.

PMID: 30649169 PMC: 7373185. DOI: 10.1093/bib/bby120.


Network-based machine learning and graph theory algorithms for precision oncology.

Zhang W, Chien J, Yong J, Kuang R NPJ Precis Oncol. 2018; 1(1):25.

PMID: 29872707 PMC: 5871915. DOI: 10.1038/s41698-017-0029-7.