SJIVE: Supervised Joint and Individual Variation Explained
Overview
Affiliations
Analyzing multi-source data, which are multiple views of data on the same subjects, has become increasingly common in molecular biomedical research. Recent methods have sought to uncover underlying structure and relationships within and/or between the data sources, and other methods have sought to build a predictive model for an outcome using all sources. However, existing methods that do both are presently limited because they either (1) only consider data structure shared by all datasets while ignoring structures unique to each source, or (2) they extract underlying structures first without consideration to the outcome. The proposed method, supervised joint and individual variation explained (sJIVE), can simultaneously (1) identify shared (joint) and source-specific (individual) underlying structure and (2) build a linear prediction model for an outcome using these structures. These two components are weighted to compromise between explaining variation in the multi-source data and in the outcome. Simulations show sJIVE to outperform existing methods when large amounts of noise are present in the multi-source data. An application to data from the COPDGene study explores gene expression and proteomic patterns associated with lung function.
The need for a cancer exposome atlas: a scoping review.
Young A, Mullins C, Sehgal N, Vermeulen R, Kolijn P, Vlaanderen J JNCI Cancer Spectr. 2024; 9(1).
PMID: 39700422 PMC: 11729703. DOI: 10.1093/jncics/pkae122.
Joint and Individual Component Regression.
Wang P, Wang H, Li Q, Shen D, Liu Y J Comput Graph Stat. 2024; 33(3):763-773.
PMID: 39526223 PMC: 11545161. DOI: 10.1080/10618600.2023.2284227.
Joint modeling of an outcome variable and integrated omics datasets using GLM-PO2PLS.
Gu Z, Uh H, Houwing-Duistermaat J, El Bouhaddani S J Appl Stat. 2024; 51(13):2627-2651.
PMID: 39290359 PMC: 11404385. DOI: 10.1080/02664763.2024.2313458.
Seffernick A, Cao X, Cheng C, Yang W, Autry R, Yang J bioRxiv. 2024; .
PMID: 39131398 PMC: 11312528. DOI: 10.1101/2024.07.31.605805.
Jain S, Safo S Brief Bioinform. 2024; 25(4).
PMID: 39007595 PMC: 11771283. DOI: 10.1093/bib/bbae339.