» Articles » PMID: 30059495

Bayesian Variable Selection with Graphical Structure Learning: Applications in Integrative Genomics

Overview
Journal PLoS One
Date 2018 Jul 31
PMID 30059495
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Significant advances in biotechnology have allowed for simultaneous measurement of molecular data across multiple genomic, epigenomic and transcriptomic levels from a single tumor/patient sample. This has motivated systematic data-driven approaches to integrate multi-dimensional structured datasets, since cancer development and progression is driven by numerous co-ordinated molecular alterations and the interactions between them. We propose a novel multi-scale Bayesian approach that combines integrative graphical structure learning from multiple sources of data with a variable selection framework-to determine the key genomic drivers of cancer progression. The integrative structure learning is first accomplished through novel joint graphical models for heterogeneous (mixed scale) data, allowing for flexible and interpretable incorporation of prior existing knowledge. This subsequently informs a variable selection step to identify groups of co-ordinated molecular features within and across platforms associated with clinical outcomes of cancer progression, while according appropriate adjustments for multicollinearity and multiplicities. We evaluate our methods through rigorous simulations to establish superiority over existing methods that do not take the network and/or prior information into account. Our methods are motivated by and applied to a glioblastoma multiforme (GBM) dataset from The Cancer Genome Atlas to predict patient survival times integrating gene expression, copy number and methylation data. We find a high concordance between our selected prognostic gene network modules with known associations with GBM. In addition, our model discovers several novel cross-platform network interactions (both cis and trans acting) between gene expression, copy number variation associated gene dosing and epigenetic regulation through promoter methylation, some with known implications in the etiology of GBM. Our framework provides a useful tool for biomedical researchers, since clinical prediction using multi-platform genomic information is an important step towards personalized treatment of many cancers.

Citing Articles

Tutorial on survival modeling with applications to omics data.

Zhao Z, Zobolas J, Zucknick M, Aittokallio T Bioinformatics. 2024; 40(3).

PMID: 38445722 PMC: 10973942. DOI: 10.1093/bioinformatics/btae132.


Role of multiresolution vulnerability indices in COVID-19 spread in India: a Bayesian model-based analysis.

Bhattacharyya R, Burman A, Singh K, Banerjee S, Maity S, Auddy A BMJ Open. 2022; 12(11):e056292.

PMID: 36396323 PMC: 9676421. DOI: 10.1136/bmjopen-2021-056292.


Rank-based Bayesian variable selection for genome-wide transcriptomic analyses.

Eliseussen E, Fleischer T, Vitelli V Stat Med. 2022; 41(23):4532-4553.

PMID: 35844145 PMC: 9796757. DOI: 10.1002/sim.9524.


Graph-guided Bayesian SVM with Adaptive Structured Shrinkage Prior for High-dimensional Data.

Sun W, Chang C, Long Q Proc IEEE Int Conf Big Data. 2022; 2021:4472-4479.

PMID: 35187547 PMC: 8855458. DOI: 10.1109/bigdata52589.2021.9671712.


Knowledge-Guided Statistical Learning Methods for Analysis of High-Dimensional -Omics Data in Precision Oncology.

Zhao Y, Chang C, Long Q JCO Precis Oncol. 2022; 3.

PMID: 35100722 PMC: 9797232. DOI: 10.1200/PO.19.00018.


References
1.
Campbell I, Russell S, Choong D, Montgomery K, Ciavarella M, Hooi C . Mutation of the PIK3CA gene in ovarian and breast cancer. Cancer Res. 2004; 64(21):7678-81. DOI: 10.1158/0008-5472.CAN-04-2933. View

2.
Praveen P, Frohlich H . Boosting probabilistic graphical model inference by incorporating prior knowledge from multiple sources. PLoS One. 2013; 8(6):e67410. PMC: 3691143. DOI: 10.1371/journal.pone.0067410. View

3.
Bondell H, Reich B . Consistent high-dimensional Bayesian variable selection via penalized credible regions. J Am Stat Assoc. 2013; 107(500):1610-1624. PMC: 3587767. DOI: 10.1080/01621459.2012.716344. View

4.
Yin D, Ogawa S, Kawamata N, Tunici P, Finocchiaro G, Eoli M . High-resolution genomic copy number profiling of glioblastoma multiforme by single nucleotide polymorphism DNA microarray. Mol Cancer Res. 2009; 7(5):665-77. DOI: 10.1158/1541-7786.MCR-08-0270. View

5.
Ideker T, Dutkowski J, Hood L . Boosting signal-to-noise in complex biology: prior knowledge is power. Cell. 2011; 144(6):860-3. PMC: 3102020. DOI: 10.1016/j.cell.2011.03.007. View