» Articles » PMID: 26357333

Integrative Data Analysis of Multi-Platform Cancer Data with a Multimodal Deep Learning Approach

Overview
Specialty Biology
Date 2015 Sep 11
PMID 26357333
Citations 64
Authors
Affiliations
Soon will be listed here.
Abstract

Identification of cancer subtypes plays an important role in revealing useful insights into disease pathogenesis and advancing personalized therapy. The recent development of high-throughput sequencing technologies has enabled the rapid collection of multi-platform genomic data (e.g., gene expression, miRNA expression, and DNA methylation) for the same set of tumor samples. Although numerous integrative clustering approaches have been developed to analyze cancer data, few of them are particularly designed to exploit both deep intrinsic statistical properties of each input modality and complex cross-modality correlations among multi-platform input data. In this paper, we propose a new machine learning model, called multimodal deep belief network (DBN), to cluster cancer patients from multi-platform observation data. In our integrative clustering framework, relationships among inherent features of each single modality are first encoded into multiple layers of hidden variables, and then a joint latent model is employed to fuse common features derived from multiple input modalities. A practical learning algorithm, called contrastive divergence (CD), is applied to infer the parameters of our multimodal DBN model in an unsupervised manner. Tests on two available cancer datasets show that our integrative data analysis approach can effectively extract a unified representation of latent features to capture both intra- and cross-modality correlations, and identify meaningful disease subtypes from multi-platform cancer data. In addition, our approach can identify key genes and miRNAs that may play distinct roles in the pathogenesis of different cancer subtypes. Among those key miRNAs, we found that the expression level of miR-29a is highly correlated with survival time in ovarian cancer patients. These results indicate that our multimodal DBN based data analysis approach may have practical applications in cancer pathogenesis studies and provide useful guidelines for personalized cancer therapy.

Citing Articles

Guardrails for the use of generalist AI in cancer care.

Gilbert S, Kather J Nat Rev Cancer. 2024; 24(6):357-358.

PMID: 38627556 DOI: 10.1038/s41568-024-00685-8.


Deep Learning for Genomics: From Early Neural Nets to Modern Large Language Models.

Yue T, Wang Y, Zhang L, Gu C, Xue H, Wang W Int J Mol Sci. 2023; 24(21).

PMID: 37958843 PMC: 10649223. DOI: 10.3390/ijms242115858.


MOCSS: Multi-omics data clustering and cancer subtyping via shared and specific representation learning.

Chen Y, Wen Y, Xie C, Chen X, He S, Bo X iScience. 2023; 26(8):107378.

PMID: 37559907 PMC: 10407241. DOI: 10.1016/j.isci.2023.107378.


Spatial mapping of the DNA adducts in cancer.

Krieger K, Mann E, Lee K, Bolterstein E, Jebakumar D, Ittmann M DNA Repair (Amst). 2023; 128:103529.

PMID: 37390674 PMC: 10330576. DOI: 10.1016/j.dnarep.2023.103529.


Prediction of drug sensitivity based on multi-omics data using deep learning and similarity network fusion approaches.

Liu X, Mei X Front Bioeng Biotechnol. 2023; 11:1156372.

PMID: 37139048 PMC: 10150883. DOI: 10.3389/fbioe.2023.1156372.