» Articles » PMID: 39097651

A Model-free and Distribution-free Multi-omics Integration Approach for Detecting Novel Lung Adenocarcinoma Genes

Overview
Journal Sci Rep
Specialty Science
Date 2024 Aug 3
PMID 39097651
Authors
Affiliations
Soon will be listed here.
Abstract

Detection of important genes affecting lung adenocarcinoma (LUAD) is critical to finding effective therapeutic targets for this highly lethal cancer. However, many existing approaches have focused on single outcomes or phenotypic associations, which may not be as thorough as investigating molecular transcript levels within cells. In this article, we apply a novel multivariate rank-distance correlation-based gene selection procedure (MrDcGene) to LUAD multi-omics data downloaded from The Cancer Genome Atlas (TCGA). MrDcGene provides additional opportunities for detecting novel susceptibility genes as it leverages information from multiple platforms, while efficiently handling challenges such as high dimensionality, low signal-to-noise ratio, unknown distributions, and non-linear structures, etc. Notably, the MrDcGene method is able to detect two different scenarios, i.e., strong association strength with a few gene expressions and weak association strength with several gene expressions. After thoroughly exploring the association between gene expression (GE) and multiple other platforms, including reverse phase protein array (RPPA), miRNA, copy number variation (CNV) and DNA methylation (ME), we detect several novel genes that may play an important role in LUAD (ZNF133, CCDC159, YWHAZ, HNRNPR. ITPR2, PTHLH, and WIPI2). In addition, we quantitatively validate several other susceptibility genes that were reported in the literature using different methods and studies. The accuracy of the MrDcGene approach is theoretically assured and empirically demonstrated by the simulation studies.

References
1.
Takahashi S, Asada K, Takasawa K, Shimoyama R, Sakai A, Bolatkan A . Predicting Deep Learning Based Multi-Omics Parallel Integration Survival Subtypes in Lung Cancer Using Reverse Phase Protein Array Data. Biomolecules. 2020; 10(10). PMC: 7603376. DOI: 10.3390/biom10101460. View

2.
Sherafatian M, Arjmand F . Decision tree-based classifiers for lung cancer diagnosis and subtyping using TCGA miRNA expression data. Oncol Lett. 2019; 18(2):2125-2131. PMC: 6607095. DOI: 10.3892/ol.2019.10462. View

3.
Shi Y, Au J, Thongprasert S, Srinivasan S, Tsai C, Khoa M . A prospective, molecular epidemiology study of EGFR mutations in Asian patients with advanced non-small-cell lung cancer of adenocarcinoma histology (PIONEER). J Thorac Oncol. 2014; 9(2):154-62. PMC: 4132036. DOI: 10.1097/JTO.0000000000000033. View

4.
Zhao Q, Shi X, Xie Y, Huang J, Shia B, Ma S . Combining multidimensional genomic measurements for predicting cancer prognosis: observations from TCGA. Brief Bioinform. 2014; 16(2):291-303. PMC: 4375393. DOI: 10.1093/bib/bbu003. View

5.
Wu C, Zhou F, Ren J, Li X, Jiang Y, Ma S . A Selective Review of Multi-Level Omics Data Integration Using Variable Selection. High Throughput. 2019; 8(1). PMC: 6473252. DOI: 10.3390/ht8010004. View