» Articles » PMID: 29361062

RGBM: Regularized Gradient Boosting Machines for Identification of the Transcriptional Regulators of Discrete Glioma Subtypes

Overview
Specialty Biochemistry
Date 2018 Jan 24
PMID 29361062
Citations 20
Authors
Affiliations
Soon will be listed here.
Abstract

We propose a generic framework for gene regulatory network (GRN) inference approached as a feature selection problem. GRNs obtained using Machine Learning techniques are often dense, whereas real GRNs are rather sparse. We use a Tikonov regularization inspired optimal L-curve criterion that utilizes the edge weight distribution for a given target gene to determine the optimal set of TFs associated with it. Our proposed framework allows to incorporate a mechanistic active biding network based on cis-regulatory motif analysis. We evaluate our regularization framework in conjunction with two non-linear ML techniques, namely gradient boosting machines (GBM) and random-forests (GENIE), resulting in a regularized feature selection based method specifically called RGBM and RGENIE respectively. RGBM has been used to identify the main transcription factors that are causally involved as master regulators of the gene expression signature activated in the FGFR3-TACC3-positive glioblastoma. Here, we illustrate that RGBM identifies the main regulators of the molecular subtypes of brain tumors. Our analysis reveals the identity and corresponding biological activities of the master regulators characterizing the difference between G-CIMP-high and G-CIMP-low subtypes and between PA-like and LGm6-GBM, thus providing a clue to the yet undetermined nature of the transcriptional events among these subtypes.

Citing Articles

VISH-Pred: an ensemble of fine-tuned ESM models for protein toxicity prediction.

Mall R, Singh A, Patel C, Guirimand G, Castiglione F Brief Bioinform. 2024; 25(4).

PMID: 38842509 PMC: 11154842. DOI: 10.1093/bib/bbae270.


DeepFGRN: inference of gene regulatory network with regulation type based on directed graph embedding.

Gao Z, Su Y, Xia J, Cao R, Ding Y, Zheng C Brief Bioinform. 2024; 25(3).

PMID: 38581416 PMC: 10998536. DOI: 10.1093/bib/bbae143.


A machine learning approach to differentiate wide QRS tachycardia: distinguishing ventricular tachycardia from supraventricular tachycardia.

Li Z, Zhao W, Mao Y, Bo D, Chen Q, Kojodjojo P J Interv Card Electrophysiol. 2024; 67(6):1391-1398.

PMID: 38246906 DOI: 10.1007/s10840-024-01743-9.


Pancancer network analysis reveals key master regulators for cancer invasiveness.

Jethalia M, Jani S, Ceccarelli M, Mall R J Transl Med. 2023; 21(1):558.

PMID: 37599366 PMC: 10440887. DOI: 10.1186/s12967-023-04435-6.


Pancancer transcriptomic profiling identifies key PANoptosis markers as therapeutic targets for oncology.

Mall R, Bynigeri R, Karki R, Malireddi R, Sharma B, Kanneganti T NAR Cancer. 2022; 4(4):zcac033.

PMID: 36329783 PMC: 9623737. DOI: 10.1093/narcan/zcac033.


References
1.
Johnson W, Li C, Rabinovic A . Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2006; 8(1):118-27. DOI: 10.1093/biostatistics/kxj037. View

2.
Schaffter T, Marbach D, Floreano D . GeneNetWeaver: in silico benchmark generation and performance profiling of network inference methods. Bioinformatics. 2011; 27(16):2263-70. DOI: 10.1093/bioinformatics/btr373. View

3.
Han H, Shim H, Shin D, Shim J, Ko Y, Shin J . TRRUST: a reference database of human transcriptional regulatory interactions. Sci Rep. 2015; 5:11432. PMC: 4464350. DOI: 10.1038/srep11432. View

4.
Ceccarelli M, Barthel F, Malta T, Sabedot T, Salama S, Murray B . Molecular Profiling Reveals Biologically Discrete Subsets and Pathways of Progression in Diffuse Glioma. Cell. 2016; 164(3):550-63. PMC: 4754110. DOI: 10.1016/j.cell.2015.12.028. View

5.
Frattini V, Pagnotta S, Tala , Fan J, Russo M, Lee S . A metabolic function of FGFR3-TACC3 gene fusions in cancer. Nature. 2018; 553(7687):222-227. PMC: 5771419. DOI: 10.1038/nature25171. View