» Articles » PMID: 32321967

OutPredict: Multiple Datasets Can Improve Prediction of Expression and Inference of Causality

Overview
Journal Sci Rep
Specialty Science
Date 2020 Apr 24
PMID 32321967
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

The ability to accurately predict the causal relationships from transcription factors to genes would greatly enhance our understanding of transcriptional dynamics. This could lead to applications in which one or more transcription factors could be manipulated to effect a change in genes leading to the enhancement of some desired trait. Here we present a method called OutPredict that constructs a model for each gene based on time series (and other) data and that predicts gene's expression in a previously unseen subsequent time point. The model also infers causal relationships based on the most important transcription factors for each gene model, some of which have been validated from previous physical experiments. The method benefits from known network edges and steady-state data to enhance predictive accuracy. Our results across B. subtilis, Arabidopsis, E.coli, Drosophila and the DREAM4 simulated in silico dataset show improved predictive accuracy ranging from 40% to 60% over other state-of-the-art methods. We find that gene expression models can benefit from the addition of steady-state data to predict expression values of time series. Finally, we validate, based on limited available data, that the influential edges we infer correspond to known relationships significantly more than expected by chance or by state-of-the-art methods.

Citing Articles

Rewiring gene circuitry for plant improvement.

Borowsky A, Bailey-Serres J Nat Genet. 2024; 56(8):1574-1582.

PMID: 39075207 DOI: 10.1038/s41588-024-01806-7.


Bipartite networks represent causality better than simple networks: evidence, algorithms, and applications.

Shen B, Curozzi G, Shasha D Front Genet. 2024; 15:1371607.

PMID: 38798697 PMC: 11120958. DOI: 10.3389/fgene.2024.1371607.


Nitrogen sensing and regulatory networks: it's about time and space.

Shanks C, Rothkegel K, Brooks M, Cheng C, Alvarez J, Ruffel S Plant Cell. 2024; 36(5):1482-1503.

PMID: 38366121 PMC: 11062454. DOI: 10.1093/plcell/koae038.


Building High-Confidence Gene Regulatory Networks by Integrating Validated TF-Target Gene Interactions Using ConnecTF.

Huang J, Katari M, Juang C, Coruzzi G, Brooks M Methods Mol Biol. 2023; 2698:195-220.

PMID: 37682477 DOI: 10.1007/978-1-0716-3354-0_13.


EnsInfer: a simple ensemble approach to network inference outperforms any single method.

Shen B, Coruzzi G, Shasha D BMC Bioinformatics. 2023; 24(1):114.

PMID: 36964499 PMC: 10037858. DOI: 10.1186/s12859-023-05231-1.


References
1.
Le Novere N . Quantitative and logic modelling of molecular and gene networks. Nat Rev Genet. 2015; 16(3):146-58. PMC: 4604653. DOI: 10.1038/nrg3885. View

2.
Gregis V, Andres F, Sessa A, Guerra R, Simonini S, Mateos J . Identification of pathways directly regulated by SHORT VEGETATIVE PHASE during vegetative and reproductive development in Arabidopsis. Genome Biol. 2013; 14(6):R56. PMC: 3706845. DOI: 10.1186/gb-2013-14-6-r56. View

3.
Slattery M, Zhou T, Yang L, Dantas Machado A, Gordan R, Rohs R . Absence of a simple code: how transcription factors read the genome. Trends Biochem Sci. 2014; 39(9):381-99. PMC: 4149858. DOI: 10.1016/j.tibs.2014.07.002. View

4.
Greenfield A, Hafemeister C, Bonneau R . Robust data-driven incorporation of prior knowledge into the inference of dynamic regulatory networks. Bioinformatics. 2013; 29(8):1060-7. PMC: 3624811. DOI: 10.1093/bioinformatics/btt099. View

5.
Chai L, Loh S, Low S, Mohamad M, Deris S, Zakaria Z . A review on the computational approaches for gene regulatory network construction. Comput Biol Med. 2014; 48:55-65. DOI: 10.1016/j.compbiomed.2014.02.011. View