» Articles » PMID: 16174746

Discovering Statistically Significant Pathways in Expression Profiling Studies

Overview
Specialty Science
Date 2005 Sep 22
PMID 16174746
Citations 309
Authors
Affiliations
Soon will be listed here.
Abstract

Accurate and rapid identification of perturbed pathways through the analysis of genome-wide expression profiles facilitates the generation of biological hypotheses. We propose a statistical framework for determining whether a specified group of genes for a pathway has a coordinated association with a phenotype of interest. Several issues on proper hypothesis-testing procedures are clarified. In particular, it is shown that the differences in the correlation structure of each set of genes can lead to a biased comparison among gene sets unless a normalization procedure is applied. We propose statistical tests for two important but different aspects of association for each group of genes. This approach has more statistical power than currently available methods and can result in the discovery of statistically significant pathways that are not detected by other methods. This method is applied to data sets involving diabetes, inflammatory myopathies, and Alzheimer's disease, using gene sets we compiled from various public databases. In the case of inflammatory myopathies, we have correctly identified the known cytotoxic T lymphocyte-mediated autoimmunity in inclusion body myositis. Furthermore, we predicted the presence of dendritic cells in inclusion body myositis and of an IFN-alpha/beta response in dermatomyositis, neither of which was previously described. These predictions have been subsequently corroborated by immunohistochemistry.

Citing Articles

Pathway-based analyses of gene expression profiles at low doses of ionizing radiation.

Luo X, Niyakan S, Johnstone P, McCorkle S, Park G, Lopez-Marrero V Front Bioinform. 2024; 4:1280971.

PMID: 38812660 PMC: 11135168. DOI: 10.3389/fbinf.2024.1280971.


Robustness evaluations of pathway activity inference methods on gene expression data.

Hui T, Kasim S, Abdul Aziz I, Fudzee M, Haron N, Sutikno T BMC Bioinformatics. 2024; 25(1):23.

PMID: 38216898 PMC: 10785356. DOI: 10.1186/s12859-024-05632-w.


Pathway analysis through mutual information.

Jeuken G, Kall L Bioinformatics. 2024; 40(1).

PMID: 38195928 PMC: 10783954. DOI: 10.1093/bioinformatics/btad776.


Explainable protein function annotation using local structure embeddings.

Derry A, Altman R bioRxiv. 2023; .

PMID: 37905033 PMC: 10614799. DOI: 10.1101/2023.10.13.562298.


Dissecting Pathway Disturbances Using Network Topology and Multi-platform Genomics Data.

Zhang Y, Linder M, Shojaie A, Ouyang Z, Shen R, Baggerly K Stat Biosci. 2023; 10(1):86-106.

PMID: 37388623 PMC: 10309155. DOI: 10.1007/s12561-017-9193-0.


References
1.
LaFerla F . Calcium dyshomeostasis and intracellular signalling in Alzheimer's disease. Nat Rev Neurosci. 2002; 3(11):862-72. DOI: 10.1038/nrn960. View

2.
Pavlidis P, Li Q, Noble W . The effect of replication on gene expression microarray experiments. Bioinformatics. 2003; 19(13):1620-7. DOI: 10.1093/bioinformatics/btg227. View

3.
Dalakas M, Hohlfeld R . Polymyositis and dermatomyositis. Lancet. 2003; 362(9388):971-82. DOI: 10.1016/S0140-6736(03)14368-1. View

4.
Damian D, Gorfine M . Statistical concerns about the GSEA procedure. Nat Genet. 2004; 36(7):663. DOI: 10.1038/ng0704-663a. View

5.
Zhong S, Li C, Wong W . ChipInfo: Software for extracting gene annotation and gene ontology information for microarray analysis. Nucleic Acids Res. 2003; 31(13):3483-6. PMC: 169004. DOI: 10.1093/nar/gkg598. View