» Articles » PMID: 31243065

MOGSA: Integrative Single Sample Gene-set Analysis of Multiple Omics Data

Overview
Date 2019 Jun 28
PMID 31243065
Citations 42
Authors
Affiliations
Soon will be listed here.
Abstract

Gene-set analysis (GSA) summarizes individual molecular measurements to more interpretable pathways or gene-sets and has become an indispensable step in the interpretation of large-scale omics data. However, GSA methods are limited to the analysis of single omics data. Here, we introduce a new computation method termed multi-omics gene-set analysis (MOGSA), a multivariate single sample gene-set analysis method that integrates multiple experimental and molecular data types measured over the same set of samples. The method learns a low dimensional representation of most variant correlated features (genes, proteins, etc.) across multiple omics data sets, transforms the features onto the same scale and calculates an integrated gene-set score from the most informative features in each data type. MOGSA does not require filtering data to the intersection of features (gene IDs), therefore, all molecular features, including those that lack annotation may be included in the analysis. Using simulated data, we demonstrate that integrating multiple diverse sources of molecular data increases the power to discover subtle changes in gene-sets and may reduce the impact of unreliable information in any single data type. Using real experimental data, we demonstrate three use-cases of MOGSA. First, we show how to remove a source of noise (technical or biological) in integrative MOGSA of NCI60 transcriptome and proteome data. Second, we apply MOGSA to discover similarities and differences in mRNA, protein and phosphorylation profiles of a small study of stem cell lines and assess the influence of each data type or feature on the total gene-set score. Finally, we apply MOGSA to cluster analysis and show that three molecular subtypes are robustly discovered when copy number variation and mRNA data of 308 bladder cancers from The Cancer Genome Atlas are integrated using MOGSA. MOGSA is available in the Bioconductor R package "mogsa."

Citing Articles

nipalsMCIA: flexible multi-block dimensionality reduction in R via nonlinear iterative partial least squares.

Mattessich M, Reyna J, Aron E, Ay F, Kilmer M, Kleinstein S Bioinformatics. 2025; 41(1).

PMID: 39799512 PMC: 11783316. DOI: 10.1093/bioinformatics/btaf015.


From Omics to Multi-Omics: A Review of Advantages and Tradeoffs.

Hayes C, Nakahara H, Ono A, Tsuge M, Oka S Genes (Basel). 2025; 15(12.

PMID: 39766818 PMC: 11675490. DOI: 10.3390/genes15121551.


Enhancing immune response and survival in hepatocellular carcinoma with novel oncolytic Jurona virus and immune checkpoint blockade.

Tesfay M, Zhang Y, Ferdous K, Taylor M, Cios A, Shelton R Mol Ther Oncol. 2025; 32(4):200913.

PMID: 39758249 PMC: 11697550. DOI: 10.1016/j.omton.2024.200913.


Methods for multi-omic data integration in cancer research.

Hernandez-Lemus E, Ochoa S Front Genet. 2024; 15:1425456.

PMID: 39364009 PMC: 11446849. DOI: 10.3389/fgene.2024.1425456.


nipalsMCIA: Flexible Multi-Block Dimensionality Reduction in R via Non-linear Iterative Partial Least Squares.

Mattessich M, Reyna J, Aron E, Ay F, Kilmer M, Kleinstein S bioRxiv. 2024; .

PMID: 38915554 PMC: 11195050. DOI: 10.1101/2024.06.07.597819.


References
1.
Rappoport N, Shamir R . Multi-omic and multi-view clustering algorithms: review and cancer benchmark. Nucleic Acids Res. 2018; 46(20):10546-10562. PMC: 6237755. DOI: 10.1093/nar/gky889. View

2.
Biton A, Bernard-Pierrot I, Lou Y, Krucker C, Chapeaublanc E, Rubio-Perez C . Independent component analysis uncovers the landscape of the bladder tumor transcriptome and reveals insights into luminal and basal subtypes. Cell Rep. 2014; 9(4):1235-45. DOI: 10.1016/j.celrep.2014.10.035. View

3.
Holter N, Mitra M, Maritan A, Cieplak M, Banavar J, Fedoroff N . Fundamental patterns underlying gene expression profiles: simplicity from complexity. Proc Natl Acad Sci U S A. 2000; 97(15):8409-14. PMC: 26961. DOI: 10.1073/pnas.150242097. View

4.
Knowles M, Hurst C . Molecular biology of bladder cancer: new insights into pathogenesis and clinical diversity. Nat Rev Cancer. 2014; 15(1):25-41. DOI: 10.1038/nrc3817. View

5.
Argelaguet R, Velten B, Arnol D, Dietrich S, Zenz T, Marioni J . Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets. Mol Syst Biol. 2018; 14(6):e8124. PMC: 6010767. DOI: 10.15252/msb.20178124. View