» Articles » PMID: 38238840

Structure-primed Embedding on the Transcription Factor Manifold Enables Transparent Model Architectures for Gene Regulatory Network and Latent Activity Inference

Overview
Journal Genome Biol
Specialties Biology
Genetics
Date 2024 Jan 18
PMID 38238840
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Modeling of gene regulatory networks (GRNs) is limited due to a lack of direct measurements of genome-wide transcription factor activity (TFA) making it difficult to separate covariance and regulatory interactions. Inference of regulatory interactions and TFA requires aggregation of complementary evidence. Estimating TFA explicitly is problematic as it disconnects GRN inference and TFA estimation and is unable to account for, for example, contextual transcription factor-transcription factor interactions, and other higher order features. Deep-learning offers a potential solution, as it can model complex interactions and higher-order latent features, although does not provide interpretable models and latent features.

Results: We propose a novel autoencoder-based framework, StrUcture Primed Inference of Regulation using latent Factor ACTivity (SupirFactor) for modeling, and a metric, explained relative variance (ERV), for interpretation of GRNs. We evaluate SupirFactor with ERV in a wide set of contexts. Compared to current state-of-the-art GRN inference methods, SupirFactor performs favorably. We evaluate latent feature activity as an estimate of TFA and biological function in S. cerevisiae as well as in peripheral blood mononuclear cells (PBMC).

Conclusion: Here we present a framework for structure-primed inference and interpretation of GRNs, SupirFactor, demonstrating interpretability using ERV in multiple biological and experimental settings. SupirFactor enables TFA estimation and pathway analysis using latent factor activity, demonstrated here on two large-scale single-cell datasets, modeling S. cerevisiae and PBMC. We find that the SupirFactor model facilitates biological analysis acquiring novel functional and regulatory insight.

Citing Articles

GeneSPIDER2: large scale GRN simulation and benchmarking with perturbed single-cell data.

Garbulowski M, Hillerton T, Morgan D, Secilmis D, Sonnhammer L, Tjarnberg A NAR Genom Bioinform. 2024; 6(3):lqae121.

PMID: 39296931 PMC: 11409065. DOI: 10.1093/nargab/lqae121.


PMF-GRN: a variational inference approach to single-cell gene regulatory network inference using probabilistic matrix factorization.

Skok Gibbs C, Mahmood O, Bonneau R, Cho K Genome Biol. 2024; 25(1):88.

PMID: 38589899 PMC: 11003171. DOI: 10.1186/s13059-024-03226-6.


Reliable interpretability of biology-inspired deep neural networks.

Esser-Skala W, Fortelny N NPJ Syst Biol Appl. 2023; 9(1):50.

PMID: 37816807 PMC: 10564878. DOI: 10.1038/s41540-023-00310-8.


Simultaneous estimation of gene regulatory network structure and RNA kinetics from single cell gene expression.

Jackson C, Beheler-Amass M, Tjarnberg A, Suresh I, Hickey A, Bonneau R bioRxiv. 2023; .

PMID: 37790443 PMC: 10542544. DOI: 10.1101/2023.09.21.558277.

References
1.
Arrieta-Ortiz M, Hafemeister C, Bate A, Chu T, Greenfield A, Shuster B . An experimentally supported model of the Bacillus subtilis global transcriptional regulatory network. Mol Syst Biol. 2015; 11(11):839. PMC: 4670728. DOI: 10.15252/msb.20156236. View

2.
Wolf F, Angerer P, Theis F . SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018; 19(1):15. PMC: 5802054. DOI: 10.1186/s13059-017-1382-0. View

3.
Novakovsky G, Dexter N, Libbrecht M, Wasserman W, Mostafavi S . Obtaining genetics insights from deep learning via explainable artificial intelligence. Nat Rev Genet. 2022; 24(2):125-137. DOI: 10.1038/s41576-022-00532-2. View

4.
Buenrostro J, Giresi P, Zaba L, Chang H, Greenleaf W . Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods. 2013; 10(12):1213-8. PMC: 3959825. DOI: 10.1038/nmeth.2688. View

5.
Tchourine K, Vogel C, Bonneau R . Condition-Specific Modeling of Biophysical Parameters Advances Inference of Regulatory Networks. Cell Rep. 2018; 23(2):376-388. PMC: 5987223. DOI: 10.1016/j.celrep.2018.03.048. View