» Articles » PMID: 33133130

Kernel Fusion Method for Detecting Cancer Subtypes Via Selecting Relevant Expression Data

Overview
Journal Front Genet
Date 2020 Nov 2
PMID 33133130
Citations 9
Authors
Affiliations
Soon will be listed here.
Abstract

Recently, cancer has been characterized as a heterogeneous disease composed of many different subtypes. Early diagnosis of cancer subtypes is an important study of cancer research, which can be of tremendous help to patients after treatment. In this paper, we first extract a novel dataset, which contains gene expression, miRNA expression, and isoform expression of five cancers from The Cancer Genome Atlas (TCGA). Next, to avoid the effect of noise existing in 60, 483 genes, we select a small number of genes by using LASSO that employs gene expression and survival time of patients. Then, we construct one similarity kernel for each expression data by using Chebyshev distance. And also, We used SKF to fused the three similarity matrix composed of gene, Iso, and miRNA, and finally clustered the fused similarity matrix with spectral clustering. In the experimental results, our method has better -value in the Cox model than other methods on 10 cancer data from Jiang Dataset and Novel Dataset. We have drawn different survival curves for different cancers and found that some genes play a key role in cancer. For breast cancer, we find out that HSPA2A, RNASE1, CLIC6, and IFITM1 are highly expressed in some specific groups. For lung cancer, we ensure that C4BPA, SESN3, and IRS1 are highly expressed in some specific groups. The code and all supporting data files are available from https://github.com/guofei-tju/Uncovering-Cancer-Subtypes-via-LASSO.

Citing Articles

Characterization of stem cell landscape and identification of stemness-relevant prognostic gene signature to aid immunotherapy in breast cancer.

Yang X, Yang X, Tang H, Chen X, Wang J, Zhao H Discov Oncol. 2025; 16(1):9.

PMID: 39755992 PMC: 11700959. DOI: 10.1007/s12672-025-01742-w.


ForestSubtype: a cancer subtype identifying approach based on high-dimensional genomic data and a parallel random forest.

Luo J, Feng Y, Wu X, Li R, Shi J, Chang W BMC Bioinformatics. 2023; 24(1):289.

PMID: 37468832 PMC: 10354904. DOI: 10.1186/s12859-023-05412-y.


Exploration of prognostic genes and risk signature in breast cancer patients based on RNA binding proteins associated with ferroptosis.

Chen X, Yang C, Wang W, He X, Sun H, Lyu W Front Genet. 2023; 14:1025163.

PMID: 36911389 PMC: 9998954. DOI: 10.3389/fgene.2023.1025163.


A Cell Differentiation Trajectory-Related Signature for Predicting the Prognosis of Lung Adenocarcinoma.

Yang F, Zhao Y, Huang X, Zhang J, Zhang T Genet Res (Camb). 2022; 2022:3483498.

PMID: 36072012 PMC: 9398881. DOI: 10.1155/2022/3483498.


Identification of novel tumor microenvironment-associated genes in gastric cancer based on single-cell RNA-sequencing datasets.

Wei X, Liu J, Hong Z, Chen X, Wang K, Cai J Front Genet. 2022; 13:896064.

PMID: 36046240 PMC: 9421061. DOI: 10.3389/fgene.2022.896064.


References
1.
Chen P, Fan Y, Man T, Hung Y, Lau C, Wong S . A gene signature based method for identifying subtypes and subtype-specific drivers in cancer with an application to medulloblastoma. BMC Bioinformatics. 2014; 14 Suppl 18:S1. PMC: 3820164. DOI: 10.1186/1471-2105-14-S18-S1. View

2.
Brunet J, Tamayo P, Golub T, Mesirov J . Metagenes and molecular pattern discovery using matrix factorization. Proc Natl Acad Sci U S A. 2004; 101(12):4164-9. PMC: 384712. DOI: 10.1073/pnas.0308531101. View

3.
Guo F, Wang D, Wang L . Progressive approach for SNP calling and haplotype assembly using single molecular sequencing data. Bioinformatics. 2018; 34(12):2012-2018. DOI: 10.1093/bioinformatics/bty059. View

4.
Zhang W, Feng H, Wu H, Zheng X . Accounting for tumor purity improves cancer subtype classification from DNA methylation data. Bioinformatics. 2017; 33(17):2651-2657. PMC: 6410888. DOI: 10.1093/bioinformatics/btx303. View

5.
Miotto R, Wang F, Wang S, Jiang X, Dudley J . Deep learning for healthcare: review, opportunities and challenges. Brief Bioinform. 2017; 19(6):1236-1246. PMC: 6455466. DOI: 10.1093/bib/bbx044. View