» Articles » PMID: 38527066

Enrichment on Steps, Not Genes, Improves Inference of Differentially Expressed Pathways

Overview
Specialty Biology
Date 2024 Mar 25
PMID 38527066
Authors
Affiliations
Soon will be listed here.
Abstract

Enrichment analysis is frequently used in combination with differential expression data to investigate potential commonalities amongst lists of genes and generate hypotheses for further experiments. However, current enrichment analysis approaches on pathways ignore the functional relationships between genes in a pathway, particularly OR logic that occurs when a set of proteins can each individually perform the same step in a pathway. As a result, these approaches miss pathways with large or multiple sets because of an inflation of pathway size (when measured as the total gene count) relative to the number of steps. We address this problem by enriching on step-enabling entities in pathways. We treat sets of protein-coding genes as single entities, and we also weight sets to account for the number of genes in them using the multivariate Fisher's noncentral hypergeometric distribution. We then show three examples of pathways that are recovered with this method and find that the results have significant proportions of pathways not found in gene list enrichment analysis.

Citing Articles

WormBase 2024: status and transitioning to Alliance infrastructure.

Sternberg P, Van Auken K, Wang Q, Wright A, Yook K, Zarowiecki M Genetics. 2024; 227(1).

PMID: 38573366 PMC: 11075546. DOI: 10.1093/genetics/iyae050.

References
1.
Regev A, Teichmann S, Lander E, Amit I, Benoist C, Birney E . The Human Cell Atlas. Elife. 2017; 6. PMC: 5762154. DOI: 10.7554/eLife.27041. View

2.
Sherman B, Hao M, Qiu J, Jiao X, Baseler M, Lane H . DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic Acids Res. 2022; 50(W1):W216-W221. PMC: 9252805. DOI: 10.1093/nar/gkac194. View

3.
Aukrust P, Gullestad L, Lappegard K, Ueland T, Aass H, Wikeby L . Complement activation in patients with congestive heart failure: effect of high-dose intravenous immunoglobulin treatment. Circulation. 2001; 104(13):1494-500. DOI: 10.1161/hc3801.096353. View

4.
Lappegard K, Garred P, Jonasson L, Espevik T, Aukrust P, Yndestad A . A vital role for complement in heart disease. Mol Immunol. 2014; 61(2):126-34. DOI: 10.1016/j.molimm.2014.06.036. View

5.
Thomas P, Hill D, Mi H, Osumi-Sutherland D, Van Auken K, Carbon S . Gene Ontology Causal Activity Modeling (GO-CAM) moves beyond GO annotations to structured descriptions of biological functions and systems. Nat Genet. 2019; 51(10):1429-1433. PMC: 7012280. DOI: 10.1038/s41588-019-0500-1. View