» Articles » PMID: 38630807

CoVar: A Generalizable Machine Learning Approach to Identify the Coordinated Regulators Driving Variational Gene Expression

Overview
Specialty Biology
Date 2024 Apr 17
PMID 38630807
Authors
Affiliations
Soon will be listed here.
Abstract

Network inference is used to model transcriptional, signaling, and metabolic interactions among genes, proteins, and metabolites that identify biological pathways influencing disease pathogenesis. Advances in machine learning (ML)-based inference models exhibit the predictive capabilities of capturing latent patterns in genomic data. Such models are emerging as an alternative to the statistical models identifying causative factors driving complex diseases. We present CoVar, an ML-based framework that builds upon the properties of existing inference models, to find the central genes driving perturbed gene expression across biological states. Unlike differentially expressed genes (DEGs) that capture changes in individual gene expression across conditions, CoVar focuses on identifying variational genes that undergo changes in their expression network interaction profiles, providing insights into changes in the regulatory dynamics, such as in disease pathogenesis. Subsequently, it finds core genes from among the nearest neighbors of these variational genes, which are central to the variational activity and influence the coordinated regulatory processes underlying the observed changes in gene expression. Through the analysis of simulated as well as yeast expression data perturbed by the deletion of the mitochondrial genome, we show that CoVar captures the intrinsic variationality and modularity in the expression data, identifying key driver genes not found through existing differential analysis methodologies.

References
1.
Varela J, Praekelt U, Meacock P, Planta R, Mager W . The Saccharomyces cerevisiae HSP12 gene is activated by the high-osmolarity glycerol pathway and negatively regulated by protein kinase A. Mol Cell Biol. 1995; 15(11):6232-45. PMC: 230875. DOI: 10.1128/MCB.15.11.6232. View

2.
Zhou X, Cai X . Inference of differential gene regulatory networks based on gene expression and genetic perturbation data. Bioinformatics. 2019; 36(1):197-204. PMC: 6956787. DOI: 10.1093/bioinformatics/btz529. View

3.
Churko J, Mantalas G, Snyder M, Wu J . Overview of high throughput sequencing technologies to elucidate molecular pathways in cardiovascular diseases. Circ Res. 2013; 112(12):1613-23. PMC: 3831009. DOI: 10.1161/CIRCRESAHA.113.300939. View

4.
Ohrvik H, Nose Y, Wood L, Kim B, Gleber S, Ralle M . Ctr2 regulates biogenesis of a cleaved form of mammalian Ctr1 metal transporter lacking the copper- and cisplatin-binding ecto-domain. Proc Natl Acad Sci U S A. 2013; 110(46):E4279-88. PMC: 3831961. DOI: 10.1073/pnas.1311749110. View

5.
Amaral L, Scala A, Barthelemy M, Stanley H . Classes of small-world networks. Proc Natl Acad Sci U S A. 2000; 97(21):11149-52. PMC: 17168. DOI: 10.1073/pnas.200327197. View