PathIntegrate: Multivariate Modelling Approaches for Pathway-based Multi-omics Data Integration
Overview
Authors
Affiliations
As terabytes of multi-omics data are being generated, there is an ever-increasing need for methods facilitating the integration and interpretation of such data. Current multi-omics integration methods typically output lists, clusters, or subnetworks of molecules related to an outcome. Even with expert domain knowledge, discerning the biological processes involved is a time-consuming activity. Here we propose PathIntegrate, a method for integrating multi-omics datasets based on pathways, designed to exploit knowledge of biological systems and thus provide interpretable models for such studies. PathIntegrate employs single-sample pathway analysis to transform multi-omics datasets from the molecular to the pathway-level, and applies a predictive single-view or multi-view model to integrate the data. Model outputs include multi-omics pathways ranked by their contribution to the outcome prediction, the contribution of each omics layer, and the importance of each molecule in a pathway. Using semi-synthetic data we demonstrate the benefit of grouping molecules into pathways to detect signals in low signal-to-noise scenarios, as well as the ability of PathIntegrate to precisely identify important pathways at low effect sizes. Finally, using COPD and COVID-19 data we showcase how PathIntegrate enables convenient integration and interpretation of complex high-dimensional multi-omics datasets. PathIntegrate is available as an open-source Python package.
Omics Approaches in Understanding Insecticide Resistance in Mosquito Vectors.
Bharadwaj N, Sharma R, Subramanian M, Ragini G, Nagarajan S, Rahi M Int J Mol Sci. 2025; 26(5).
PMID: 40076478 PMC: 11899280. DOI: 10.3390/ijms26051854.
Hiie L, Kolde A, Pervjakova N, Reigo A, Abner E, Vosa U Sci Rep. 2025; 15(1):8470.
PMID: 40069276 PMC: 11897224. DOI: 10.1038/s41598-025-92840-w.
Sobhan M, Islam M, Mondal A bioRxiv. 2025; .
PMID: 39975150 PMC: 11838222. DOI: 10.1101/2025.01.24.634827.
Akand M, Jatsenko T, Muilwijk T, Gevaert T, Joniau S, Van der Aa F Front Oncol. 2024; 14:1424293.
PMID: 39497708 PMC: 11532112. DOI: 10.3389/fonc.2024.1424293.
Synthetic data generation methods in healthcare: A review on open-source tools and methods.
Pezoulas V, Zaridis D, Mylona E, Androutsos C, Apostolidis K, Tachos N Comput Struct Biotechnol J. 2024; 23:2892-2910.
PMID: 39108677 PMC: 11301073. DOI: 10.1016/j.csbj.2024.07.005.