» Articles » PMID: 39764438

Integration of Gene Expression and DNA Methylation Data Using MLA-GNN for Liver Cancer Biomarker Mining

Overview
Journal Front Genet
Date 2025 Jan 7
PMID 39764438
Authors
Affiliations
Soon will be listed here.
Abstract

The early symptoms of hepatocellular carcinoma patients are often subtle and easily overlooked. By the time patients exhibit noticeable symptoms, the disease has typically progressed to middle or late stages, missing optimal treatment opportunities. Therefore, discovering biomarkers is essential for elucidating their functions for the early diagnosis and prevention. In practical research, challenges such as high-dimensional features, low sample size, and the complexity of gene interactions impact the reliability of biomarker discovery and disease diagnosis when using single-omics approaches. To address these challenges, we thus propose, Multi-level attention graph neural network (MLA-GNN) model for analyzing integrated multi-omics data related to liver cancer. The proposed protocol are using feature selection strategy by removing the noise and redundant information from gene expression and DNA methylation data. Additionally, it employs the Cartesian product method to integrate multi-omics datasets. The study also analyzes gene interactions using WGCNA and identifies potential genes through the MLA-GNN model, offering innovative approaches to resolve these issues. Furthermore, this paper identifies FOXL2 as a promising liver cancer marker through gene ontology and survival analysis. Validation using box plots showed that the expression of the gene FOXL2 was higher in patients with hepatocellular carcinoma than in normal individuals. The drug sensitivity correlation and molecular docking results of FOXL2 with the liver cancer-targeting agent lenvatinib emphasized its potential role in hepatocellular carcinoma treatment and highlighted the importance of FOXL2 in hepatocellular carcinoma treatment.

References
1.
Wang T, Shao W, Huang Z, Tang H, Zhang J, Ding Z . MOGONET integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification. Nat Commun. 2021; 12(1):3445. PMC: 8187432. DOI: 10.1038/s41467-021-23774-w. View

2.
Tang J, Zhou S . A New Approach for Feature Selection from Microarray Data Based on Mutual Information. IEEE/ACM Trans Comput Biol Bioinform. 2016; 13(6):1004-1015. DOI: 10.1109/TCBB.2016.2515582. View

3.
Huang S, Cai N, Pacheco P, Narrandes S, Wang Y, Xu W . Applications of Support Vector Machine (SVM) Learning in Cancer Genomics. Cancer Genomics Proteomics. 2017; 15(1):41-51. PMC: 5822181. DOI: 10.21873/cgp.20063. View

4.
Pei G, Chen L, Zhang W . WGCNA Application to Proteomic and Metabolomic Data Analysis. Methods Enzymol. 2017; 585:135-158. DOI: 10.1016/bs.mie.2016.09.016. View

5.
Dennis Jr G, Sherman B, Hosack D, Yang J, Gao W, Lane H . DAVID: Database for Annotation, Visualization, and Integrated Discovery. Genome Biol. 2003; 4(5):P3. View