» Articles » PMID: 38678587

DeepKEGG: a Multi-omics Data Integration Framework with Biological Insights for Cancer Recurrence Prediction and Biomarker Discovery

Overview
Journal Brief Bioinform
Specialty Biology
Date 2024 Apr 28
PMID 38678587
Authors
Affiliations
Soon will be listed here.
Abstract

Deep learning-based multi-omics data integration methods have the capability to reveal the mechanisms of cancer development, discover cancer biomarkers and identify pathogenic targets. However, current methods ignore the potential correlations between samples in integrating multi-omics data. In addition, providing accurate biological explanations still poses significant challenges due to the complexity of deep learning models. Therefore, there is an urgent need for a deep learning-based multi-omics integration method to explore the potential correlations between samples and provide model interpretability. Herein, we propose a novel interpretable multi-omics data integration method (DeepKEGG) for cancer recurrence prediction and biomarker discovery. In DeepKEGG, a biological hierarchical module is designed for local connections of neuron nodes and model interpretability based on the biological relationship between genes/miRNAs and pathways. In addition, a pathway self-attention module is constructed to explore the correlation between different samples and generate the potential pathway feature representation for enhancing the prediction performance of the model. Lastly, an attribution-based feature importance calculation method is utilized to discover biomarkers related to cancer recurrence and provide a biological interpretation of the model. Experimental results demonstrate that DeepKEGG outperforms other state-of-the-art methods in 5-fold cross validation. Furthermore, case studies also indicate that DeepKEGG serves as an effective tool for biomarker discovery. The code is available at https://github.com/lanbiolab/DeepKEGG.

Citing Articles

Building an intelligent diabetes Q&A system with knowledge graphs and large language models.

Qin Z, Wu D, Zang Z, Chen X, Zhang H, Wong C Front Public Health. 2025; 13:1540946.

PMID: 40051508 PMC: 11884245. DOI: 10.3389/fpubh.2025.1540946.


PathX-CNN: An Enhanced Explainable Convolutional Neural Network for Survival Prediction and Pathway Analysis in Glioblastoma.

Sobhan M, Islam M, Mondal A bioRxiv. 2025; .

PMID: 39975150 PMC: 11838222. DOI: 10.1101/2025.01.24.634827.


scMoMtF: An interpretable multitask learning framework for single-cell multi-omics data analysis.

Lan W, Ling T, Chen Q, Zheng R, Li M, Pan Y PLoS Comput Biol. 2024; 20(12):e1012679.

PMID: 39693287 PMC: 11654984. DOI: 10.1371/journal.pcbi.1012679.


Deciphering the molecular heterogeneity of intermediate- and (very-)high-risk non-muscle-invasive bladder cancer using multi-layered studies.

Akand M, Jatsenko T, Muilwijk T, Gevaert T, Joniau S, Van der Aa F Front Oncol. 2024; 14:1424293.

PMID: 39497708 PMC: 11532112. DOI: 10.3389/fonc.2024.1424293.

References
1.
Huang W, Li Y, Zhang C, Zha H, Zhou X, Fu B . IGF2BP3 facilitates cell proliferation and tumorigenesis via modulation of JAK/STAT signalling pathway in human bladder cancer. J Cell Mol Med. 2020; 24(23):13949-13960. PMC: 7753985. DOI: 10.1111/jcmm.16003. View

2.
Li X, Ma J, Leng L, Han M, Li M, He F . MoGCN: A Multi-Omics Integration Method Based on Graph Convolutional Network for Cancer Subtype Analysis. Front Genet. 2022; 13:806842. PMC: 8847688. DOI: 10.3389/fgene.2022.806842. View

3.
Lin J, Lin W, Bai Y, Liao Y, Lin Q, Chen L . Identification of exosomal hsa-miR-483-5p as a potential biomarker for hepatocellular carcinoma via microRNA expression profiling of tumor-derived exosomes. Exp Cell Res. 2022; 417(2):113232. DOI: 10.1016/j.yexcr.2022.113232. View

4.
Chen H, Li M, Huang P . LncRNA SNHG16 Promotes Hepatocellular Carcinoma Proliferation, Migration and Invasion by Regulating miR-186 Expression. J Cancer. 2019; 10(15):3571-3581. PMC: 6603422. DOI: 10.7150/jca.28428. View

5.
Wu D, Wang D, Zhang M, Gu J . Fast dimension reduction and integrative clustering of multi-omics data using low-rank approximation: application to cancer molecular classification. BMC Genomics. 2015; 16:1022. PMC: 4667498. DOI: 10.1186/s12864-015-2223-8. View