» Articles » PMID: 36119152

SJIVE: Supervised Joint and Individual Variation Explained

Overview
Date 2022 Sep 19
PMID 36119152
Authors
Affiliations
Soon will be listed here.
Abstract

Analyzing multi-source data, which are multiple views of data on the same subjects, has become increasingly common in molecular biomedical research. Recent methods have sought to uncover underlying structure and relationships within and/or between the data sources, and other methods have sought to build a predictive model for an outcome using all sources. However, existing methods that do both are presently limited because they either (1) only consider data structure shared by all datasets while ignoring structures unique to each source, or (2) they extract underlying structures first without consideration to the outcome. The proposed method, supervised joint and individual variation explained (sJIVE), can simultaneously (1) identify shared (joint) and source-specific (individual) underlying structure and (2) build a linear prediction model for an outcome using these structures. These two components are weighted to compromise between explaining variation in the multi-source data and in the outcome. Simulations show sJIVE to outperform existing methods when large amounts of noise are present in the multi-source data. An application to data from the COPDGene study explores gene expression and proteomic patterns associated with lung function.

Citing Articles

The need for a cancer exposome atlas: a scoping review.

Young A, Mullins C, Sehgal N, Vermeulen R, Kolijn P, Vlaanderen J JNCI Cancer Spectr. 2024; 9(1).

PMID: 39700422 PMC: 11729703. DOI: 10.1093/jncics/pkae122.


Joint and Individual Component Regression.

Wang P, Wang H, Li Q, Shen D, Liu Y J Comput Graph Stat. 2024; 33(3):763-773.

PMID: 39526223 PMC: 11545161. DOI: 10.1080/10618600.2023.2284227.


Joint modeling of an outcome variable and integrated omics datasets using GLM-PO2PLS.

Gu Z, Uh H, Houwing-Duistermaat J, El Bouhaddani S J Appl Stat. 2024; 51(13):2627-2651.

PMID: 39290359 PMC: 11404385. DOI: 10.1080/02664763.2024.2313458.


Bootstrap Evaluation of Association Matrices (BEAM) for Integrating Multiple Omics Profiles with Multiple Outcomes.

Seffernick A, Cao X, Cheng C, Yang W, Autry R, Yang J bioRxiv. 2024; .

PMID: 39131398 PMC: 11312528. DOI: 10.1101/2024.07.31.605805.


DeepIDA-GRU: a deep learning pipeline for integrative discriminant analysis of cross-sectional and longitudinal multiview data with applications to inflammatory bowel disease classification.

Jain S, Safo S Brief Bioinform. 2024; 25(4).

PMID: 39007595 PMC: 11771283. DOI: 10.1093/bib/bbae339.


References
1.
Parker M, Chase R, Lamb A, Reyes A, Saferali A, Yun J . RNA sequencing identifies novel non-coding RNA and exon-specific effects associated with cigarette smoking. BMC Med Genomics. 2017; 10(1):58. PMC: 6225866. DOI: 10.1186/s12920-017-0295-9. View

2.
Chekouo T, Safo S . Bayesian integrative analysis and prediction with application to atherosclerosis cardiovascular disease. Biostatistics. 2021; 24(1):124-139. PMC: 9960952. DOI: 10.1093/biostatistics/kxab016. View

3.
Zhao Y, Klein A, Castellanos F, Milham M . Brain age prediction: Cortical and subcortical shape covariation in the developing human brain. Neuroimage. 2019; 202:116149. PMC: 6819257. DOI: 10.1016/j.neuroimage.2019.116149. View

4.
OConnell M, Lock E . R.JIVE for exploration of multi-source molecular data. Bioinformatics. 2016; 32(18):2877-9. PMC: 6090891. DOI: 10.1093/bioinformatics/btw324. View

5.
Guo Y, Ding X, Liu C, Xue J . Sufficient Canonical Correlation Analysis. IEEE Trans Image Process. 2016; 25(6):2610-2619. DOI: 10.1109/TIP.2016.2551374. View