OutPredict: Multiple Datasets Can Improve Prediction of Expression and Inference of Causality
Affiliations
The ability to accurately predict the causal relationships from transcription factors to genes would greatly enhance our understanding of transcriptional dynamics. This could lead to applications in which one or more transcription factors could be manipulated to effect a change in genes leading to the enhancement of some desired trait. Here we present a method called OutPredict that constructs a model for each gene based on time series (and other) data and that predicts gene's expression in a previously unseen subsequent time point. The model also infers causal relationships based on the most important transcription factors for each gene model, some of which have been validated from previous physical experiments. The method benefits from known network edges and steady-state data to enhance predictive accuracy. Our results across B. subtilis, Arabidopsis, E.coli, Drosophila and the DREAM4 simulated in silico dataset show improved predictive accuracy ranging from 40% to 60% over other state-of-the-art methods. We find that gene expression models can benefit from the addition of steady-state data to predict expression values of time series. Finally, we validate, based on limited available data, that the influential edges we infer correspond to known relationships significantly more than expected by chance or by state-of-the-art methods.
Rewiring gene circuitry for plant improvement.
Borowsky A, Bailey-Serres J Nat Genet. 2024; 56(8):1574-1582.
PMID: 39075207 DOI: 10.1038/s41588-024-01806-7.
Shen B, Curozzi G, Shasha D Front Genet. 2024; 15:1371607.
PMID: 38798697 PMC: 11120958. DOI: 10.3389/fgene.2024.1371607.
Nitrogen sensing and regulatory networks: it's about time and space.
Shanks C, Rothkegel K, Brooks M, Cheng C, Alvarez J, Ruffel S Plant Cell. 2024; 36(5):1482-1503.
PMID: 38366121 PMC: 11062454. DOI: 10.1093/plcell/koae038.
Huang J, Katari M, Juang C, Coruzzi G, Brooks M Methods Mol Biol. 2023; 2698:195-220.
PMID: 37682477 DOI: 10.1007/978-1-0716-3354-0_13.
EnsInfer: a simple ensemble approach to network inference outperforms any single method.
Shen B, Coruzzi G, Shasha D BMC Bioinformatics. 2023; 24(1):114.
PMID: 36964499 PMC: 10037858. DOI: 10.1186/s12859-023-05231-1.