» Articles » PMID: 39410583

Enhancing Non-Small Cell Lung Cancer Survival Prediction Through Multi-Omics Integration Using Graph Attention Network

Overview
Specialty Radiology
Date 2024 Oct 16
PMID 39410583
Authors
Affiliations
Soon will be listed here.
Abstract

: Cancer survival prediction is vital in improving patients' prospects and recommending therapies. Understanding the molecular behavior of cancer can be enhanced through the integration of multi-omics data, including mRNA, miRNA, and DNA methylation data. In light of these multi-omics data, we proposed a graph attention network (GAT) model in this study to predict the survival of non-small cell lung cancer (NSCLC). : The different omics data were obtained from The Cancer Genome Atlas (TCGA) and preprocessed and combined into a single dataset using the sample ID. We used the chi-square test to select the most significant features to be used in our model. We used the synthetic minority oversampling technique (SMOTE) to balance the dataset and the concordance index (C-index) to measure the performance of our model on different combinations of omics data. : Our model demonstrated superior performance, with the highest value of the C-index obtained when we used both mRNA and miRNA data. This demonstrates that the multi-omics approach could be effective in predicting survival. Further pathway analysis conducted with KEGG showed that our GAT model provided high weights to the features that are associated with the viral entry pathways, such as the Epstein-Barr virus and Influenza A pathways, which are involved in lung cancer development. From our findings, it can be observed that the proposed GAT model leads to a significantly improved prediction of survival by exploiting the strengths of multiple omics datasets and the findings from the enriched pathways. Our GAT model outperforms other state-of-the-art methods that are used for NSCLC prediction. : In this study, we developed a new model for the survival prediction of NSCLC using the GAT based on multi-omics data. Our model showed outstanding predictive values, and the KEGG analysis of the selected significant features showed that they were implicated in pivotal biological processes underlying pathways such as Influenza A and the Epstein-Barr virus infection, which are linked to lung cancer progression.

Citing Articles

Enhancing Cancerous Gene Selection and Classification for High-Dimensional Microarray Data Using a Novel Hybrid Filter and Differential Evolutionary Feature Selection.

Hashmi A, Ali W, Abulfaraj A, Binzagr F, Alkayal E Cancers (Basel). 2024; 16(23).

PMID: 39682102 PMC: 11639765. DOI: 10.3390/cancers16233913.

References
1.
Zhang D, Lu B, Liang B, Li B, Wang Z, Gu M . Interpretable deep learning survival predictive tool for small cell lung cancer. Front Oncol. 2023; 13:1162181. PMC: 10196231. DOI: 10.3389/fonc.2023.1162181. View

2.
Ritchie M, Phipson B, Wu D, Hu Y, Law C, Shi W . limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015; 43(7):e47. PMC: 4402510. DOI: 10.1093/nar/gkv007. View

3.
Austin P, Steyerberg E . Interpreting the concordance statistic of a logistic regression model: relation to the variance and odds ratio of a continuous explanatory variable. BMC Med Res Methodol. 2012; 12:82. PMC: 3528632. DOI: 10.1186/1471-2288-12-82. View

4.
Ellen J, Jacob E, Nikolaou N, Markuzon N . Autoencoder-based multimodal prediction of non-small cell lung cancer survival. Sci Rep. 2023; 13(1):15761. PMC: 10517020. DOI: 10.1038/s41598-023-42365-x. View

5.
Weng C, Chen L, Lin C, Chen H, Lee H, Ling T . Association between the risk of lung cancer and influenza: A population-based nested case-control study. Int J Infect Dis. 2019; 88:8-13. DOI: 10.1016/j.ijid.2019.07.030. View