» Articles » PMID: 35439941

De Novo Identification of Maximally Deregulated Subnetworks Based on Multi-omics Data with DeRegNet

Overview
Publisher Biomed Central
Specialty Biology
Date 2022 Apr 20
PMID 35439941
Authors
Affiliations
Soon will be listed here.
Abstract

Background: With a growing amount of (multi-)omics data being available, the extraction of knowledge from these datasets is still a difficult problem. Classical enrichment-style analyses require predefined pathways or gene sets that are tested for significant deregulation to assess whether the pathway is functionally involved in the biological process under study. De novo identification of these pathways can reduce the bias inherent in predefined pathways or gene sets. At the same time, the definition and efficient identification of these pathways de novo from large biological networks is a challenging problem.

Results: We present a novel algorithm, DeRegNet, for the identification of maximally deregulated subnetworks on directed graphs based on deregulation scores derived from (multi-)omics data. DeRegNet can be interpreted as maximum likelihood estimation given a certain probabilistic model for de-novo subgraph identification. We use fractional integer programming to solve the resulting combinatorial optimization problem. We can show that the approach outperforms related algorithms on simulated data with known ground truths. On a publicly available liver cancer dataset we can show that DeRegNet can identify biologically meaningful subgraphs suitable for patient stratification. DeRegNet can also be used to find explicitly multi-omics subgraphs which we demonstrate by presenting subgraphs with consistent methylation-transcription patterns. DeRegNet is freely available as open-source software.

Conclusion: The proposed algorithmic framework and its available implementation can serve as a valuable heuristic hypothesis generation tool contextualizing omics data within biomolecular networks.

Citing Articles

Current and future directions in network biology.

Zitnik M, Li M, Wells A, Glass K, Morselli Gysi D, Krishnan A Bioinform Adv. 2024; 4(1):vbae099.

PMID: 39143982 PMC: 11321866. DOI: 10.1093/bioadv/vbae099.


MPAC: a computational framework for inferring cancer pathway activities from multi-omic data.

Liu P, Page D, Ahlquist P, Ong I, Gitter A bioRxiv. 2024; .

PMID: 38948762 PMC: 11212914. DOI: 10.1101/2024.06.15.599113.


SUBATOMIC: a SUbgraph BAsed mulTi-OMIcs clustering framework to analyze integrated multi-edge networks.

Loers J, Vermeirssen V BMC Bioinformatics. 2022; 23(1):363.

PMID: 36064320 PMC: 9442970. DOI: 10.1186/s12859-022-04908-3.


Causal reasoning over knowledge graphs leveraging drug-perturbed and disease-specific transcriptomic signatures for drug discovery.

Domingo-Fernandez D, Gadiya Y, Patel A, Mubeen S, Rivas-Barragan D, Diana C PLoS Comput Biol. 2022; 18(2):e1009909.

PMID: 35213534 PMC: 8906585. DOI: 10.1371/journal.pcbi.1009909.

References
1.
Tuncbag N, Gosline S, Kedaigle A, Soltis A, Gitter A, Fraenkel E . Network-Based Interpretation of Diverse High-Throughput Datasets through the Omics Integrator Software Package. PLoS Comput Biol. 2016; 12(4):e1004879. PMC: 4838263. DOI: 10.1371/journal.pcbi.1004879. View

2.
Zhao X, Wang R, Chen L, Aihara K . Uncovering signal transduction networks from high-throughput data by integer linear programming. Nucleic Acids Res. 2008; 36(9):e48. PMC: 2396433. DOI: 10.1093/nar/gkn145. View

3.
Atias N, Sharan R . iPoint: an integer programming based algorithm for inferring protein subnetworks. Mol Biosyst. 2013; 9(7):1662-9. DOI: 10.1039/c3mb25432a. View

4.
Maciejewski H . Gene set analysis methods: statistical models and methodological differences. Brief Bioinform. 2013; 15(4):504-18. PMC: 4103537. DOI: 10.1093/bib/bbt002. View

5.
Undevia S, Gomez-Abuin G, Ratain M . Pharmacokinetic variability of anticancer agents. Nat Rev Cancer. 2005; 5(6):447-58. DOI: 10.1038/nrc1629. View