» Articles » PMID: 26653205

MoCluster: Identifying Joint Patterns Across Multiple Omics Data Sets

Overview
Journal J Proteome Res
Specialty Biochemistry
Date 2015 Dec 15
PMID 26653205
Citations 54
Authors
Affiliations
Soon will be listed here.
Abstract

Increasingly, multiple omics approaches are being applied to understand the complexity of biological systems. Yet, computational approaches that enable the efficient integration of such data are not well developed. Here, we describe a novel algorithm, termed moCluster, which discovers joint patterns among multiple omics data. The method first employs a multiblock multivariate analysis to define a set of latent variables representing joint patterns across input data sets, which is further passed to an ordinary clustering algorithm in order to discover joint clusters. Using simulated data, we show that moCluster's performance is not compromised by issues present in iCluster/iCluster+ (notably, the nondeterministic solution) and that it operates 100× to 1000× faster than iCluster/iCluster+. We used moCluster to cluster proteomic and transcriptomic data from the NCI-60 cell line panel. The resulting cluster model revealed different phenotypes across cellular subtypes, such as doubling time and drug response. Applying moCluster to methylation, mRNA, and protein data from a large study on colorectal cancer patients identified four molecular subtypes, including one characterized by microsatellite instability and high expression of genes/proteins involved in immunity, such as PDL1, a target of multiple drugs currently in development. The other three subtypes have not been discovered before using single data sets, which clearly illustrates the molecular complexity of oncogenesis and the need for holistic, multidata analysis strategies.

Citing Articles

MOGAN for LUAD Subtype Classification by Integrating Three Omics Data Types.

He H, Wang L, Ma M Cancer Innov. 2025; 4(2):e160.

PMID: 40026873 PMC: 11868734. DOI: 10.1002/cai2.160.


Synthetic augmentation of cancer cell line multi-omic datasets using unsupervised deep learning.

Cai Z, Apolinario S, Baiao A, Pacini C, Sousa M, Vinga S Nat Commun. 2024; 15(1):10390.

PMID: 39614072 PMC: 11607321. DOI: 10.1038/s41467-024-54771-4.


A metagene based similarity network fusion approach for multi-omics data integration identified novel subtypes in renal cell carcinoma.

Jia C, Wang T, Cui D, Tian Y, Liu G, Xu Z Brief Bioinform. 2024; 25(6).

PMID: 39562162 PMC: 11576078. DOI: 10.1093/bib/bbae606.


DeePathNet: A Transformer-Based Deep Learning Model Integrating Multiomic Data with Cancer Pathways.

Cai Z, Poulos R, Aref A, Robinson P, Reddel R, Zhong Q Cancer Res Commun. 2024; 4(12):3151-3164.

PMID: 39530738 PMC: 11652962. DOI: 10.1158/2767-9764.CRC-24-0285.


IPFMC: an iterative pathway fusion approach for enhanced multi-omics clustering in cancer research.

Zhang H, Liu S, Li B, Zhou X Brief Bioinform. 2024; 25(6).

PMID: 39470306 PMC: 11514061. DOI: 10.1093/bib/bbae541.