» Articles » PMID: 33147797

Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components

Overview
Journal Int J Mol Sci
Publisher MDPI
Date 2020 Nov 5
PMID 33147797
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

The recent development of high-throughput technology has allowed us to accumulate vast amounts of multi-omics data. Because even single omics data have a large number of variables, integrated analysis of multi-omics data suffers from problems such as computational instability and variable redundancy. Most multi-omics data analyses apply single supervised analysis, repeatedly, for dimensional reduction and variable selection. However, these approaches cannot avoid the problems of redundancy and collinearity of variables. In this study, we propose a novel approach using blockwise component analysis. This would solve the limitations of current methods by applying variable clustering and sparse principal component (sPC) analysis. Our approach consists of two stages. The first stage identifies homogeneous variable blocks, and then extracts sPCs, for each omics dataset. The second stage merges sPCs from each omics dataset, and then constructs a prediction model. We also propose a graphical method showing the results of sparse PCA and model fitting, simultaneously. We applied the proposed methodology to glioblastoma multiforme data from The Cancer Genome Atlas. The comparison with other existing approaches showed that our proposed methodology is more easily interpretable than other approaches, and has comparable predictive power, with a much smaller number of variables.

Citing Articles

Transforming Clinical Research: The Power of High-Throughput Omics Integration.

Vitorino R Proteomes. 2024; 12(3).

PMID: 39311198 PMC: 11417901. DOI: 10.3390/proteomes12030025.


Review of Personalized Medicine and Pharmacogenomics of Anti-Cancer Compounds and Natural Products.

Zhou Y, Peng S, Wang H, Cai X, Wang Q Genes (Basel). 2024; 15(4).

PMID: 38674402 PMC: 11049652. DOI: 10.3390/genes15040468.


From Data to Cure: A Comprehensive Exploration of Multi-omics Data Analysis for Targeted Therapies.

Mukherjee A, Abraham S, Singh A, Balaji S, Mukunthan K Mol Biotechnol. 2024; .

PMID: 38565775 DOI: 10.1007/s12033-024-01133-6.


MOBILE pipeline enables identification of context-specific networks and regulatory mechanisms.

Erdem C, Gross S, Heiser L, Birtwistle M Nat Commun. 2023; 14(1):3991.

PMID: 37414767 PMC: 10326020. DOI: 10.1038/s41467-023-39729-2.


MP-LASSO chart: a multi-level polar chart for visualizing group LASSO analysis of genomic data.

Song M, Lee M, Park T, Park M Genomics Inform. 2023; 20(4):e48.

PMID: 36617655 PMC: 9847381. DOI: 10.5808/gi.22075.


References
1.
Alonso-Gutierrez J, Kim E, Batth T, Cho N, Hu Q, Chan L . Principal component analysis of proteomics (PCAP) as a tool to direct metabolic engineering. Metab Eng. 2015; 28:123-133. DOI: 10.1016/j.ymben.2014.11.011. View

2.
Wang Y, Zhang X, Miao F, Cao Y, Xue J, Cao Q . Clinical significance of leukocyte-associated immunoglobulin-like receptor-1 expression in human cervical cancer. Exp Ther Med. 2017; 12(6):3699-3705. PMC: 5228450. DOI: 10.3892/etm.2016.3842. View

3.
Li Z, Safo S, Long Q . Incorporating biological information in sparse principal component analysis with application to genomic data. BMC Bioinformatics. 2017; 18(1):332. PMC: 5504598. DOI: 10.1186/s12859-017-1740-7. View

4.
Zhao K, Cui X, Wang Q, Fang C, Tan Y, Wang Y . RUNX1 contributes to the mesenchymal subtype of glioblastoma in a TGFβ pathway-dependent manner. Cell Death Dis. 2019; 10(12):877. PMC: 6872557. DOI: 10.1038/s41419-019-2108-x. View

5.
Karczewski K, Snyder M . Integrative omics for health and disease. Nat Rev Genet. 2018; 19(5):299-310. PMC: 5990367. DOI: 10.1038/nrg.2018.4. View