» Articles » PMID: 34707916

Gene Regulatory Network Inference As Relaxed Graph Matching

Overview
Date 2021 Oct 28
PMID 34707916
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

Bipartite network inference is a ubiquitous problem across disciplines. One important example in the field molecular biology is gene regulatory network inference. Gene regulatory networks are an instrumental tool aiding in the discovery of the molecular mechanisms driving diverse diseases, including cancer. However, only noisy observations of the projections of these regulatory networks are typically assayed. In an effort to better estimate regulatory networks from their noisy projections, we formulate a non-convex but analytically tractable optimization problem called OTTER. This problem can be interpreted as relaxed graph matching between the two projections of the bipartite network. OTTER's solutions can be derived explicitly and inspire a spectral algorithm, for which we provide network recovery guarantees. We also provide an alternative approach based on gradient descent that is more robust to noise compared to the spectral algorithm. Interestingly, this gradient descent approach resembles the message passing equations of an established gene regulatory network inference method, PANDA. Using three cancer-related data sets, we show that OTTER outperforms state-of-the-art inference methods in predicting transcription factor binding to gene regulatory regions. To encourage new graph matching applications to this problem, we have made all networks and validation data publicly available.

Citing Articles

Predicting TF-Target Gene Association Using a Heterogeneous Network and Enhanced Negative Sampling.

Le T, Dang X Bioinform Biol Insights. 2025; 19:11779322251316130.

PMID: 40012937 PMC: 11863233. DOI: 10.1177/11779322251316130.


HGATLink: single-cell gene regulatory network inference via the fusion of heterogeneous graph attention networks and transformer.

Sun Y, Gao J BMC Bioinformatics. 2025; 26(1):49.

PMID: 39934680 PMC: 11817978. DOI: 10.1186/s12859-025-06071-x.


Cascade Size Distributions: Why They Matter and How to Compute Them Efficiently.

Burkholz R, Quackenbush J Proc AAAI Conf Artif Intell. 2024; 35(8):6840-6849.

PMID: 39687583 PMC: 11649314. DOI: 10.1609/aaai.v35i8.16844.


Reproducible processing of TCGA regulatory networks.

Fanfani V, Shutta K, Mandros P, Fischer J, Saha E, Micheletti S bioRxiv. 2024; .

PMID: 39574772 PMC: 11580957. DOI: 10.1101/2024.11.05.622163.


Bayesian inference of sample-specific coexpression networks.

Saha E, Fanfani V, Mandros P, Ben Guebila M, Fischer J, Shutta K Genome Res. 2024; 34(9):1397-1410.

PMID: 39134413 PMC: 11529861. DOI: 10.1101/gr.279117.124.


References
1.
Singh R, Xu J, Berger B . Global alignment of multiple protein interaction networks with application to functional orthology detection. Proc Natl Acad Sci U S A. 2008; 105(35):12763-8. PMC: 2522262. DOI: 10.1073/pnas.0806627105. View

2.
Grant C, Bailey T, Noble W . FIMO: scanning for occurrences of a given motif. Bioinformatics. 2011; 27(7):1017-8. PMC: 3065696. DOI: 10.1093/bioinformatics/btr064. View

3.
Zhou F, de la Torre F . Factorized Graph Matching. IEEE Trans Pattern Anal Mach Intell. 2015; 38(9):1774-1789. DOI: 10.1109/TPAMI.2015.2501802. View

4.
Friedman J, Hastie T, Tibshirani R . Sparse inverse covariance estimation with the graphical lasso. Biostatistics. 2007; 9(3):432-41. PMC: 3019769. DOI: 10.1093/biostatistics/kxm045. View

5.
Karimzadeh M, Hoffman M . Virtual ChIP-seq: predicting transcription factor binding by learning from the transcriptome. Genome Biol. 2022; 23(1):126. PMC: 9185870. DOI: 10.1186/s13059-022-02690-2. View