» Articles » PMID: 32850695

Pan-Cancer Classification Based on Self-Normalizing Neural Networks and Feature Selection

Overview
Date 2020 Aug 28
PMID 32850695
Citations 9
Authors
Affiliations
Soon will be listed here.
Abstract

Cancer is a one of the severest diseases and cancer classification plays an important role in cancer diagnosis and treatment. Some different cancers even have similar molecular features such as DNA copy number variant. Pan-cancer classification is still non-trivial at molecular level. Herein, we propose a computational method to classify cancer types by using the self-normalizing neural network (SNN) for analyzing pan-cancer copy number variation data. Since the dimension of the copy number variation features is high, the Monte Carlo feature selection method was used to rank these features. Then a classifier was built by SNN and feature selection method to select features. Three thousand six hundred ninety-four features were chosen for the prediction model, which yields the accuracy value is 0.798 and macro F1 is 0.789. We compared our model to random forest method. Results show the accuracy and macro F1 obtained by our classifier are higher than those obtained by random forest classifier, indicating the good predictive power of our method in distinguishing four different cancer types. This method is also extendable to pan-cancer classification for other molecular features.

Citing Articles

Unveiling epigenetic regulatory elements associated with breast cancer development.

Jardanowska-Kotuniak M, Draminski M, Wlasnowolski M, Lapinski M, Sengupta K, Agarwal A bioRxiv. 2024; .

PMID: 39605637 PMC: 11601335. DOI: 10.1101/2024.11.12.623187.


Identification of circadian rhythm-related gene classification patterns and immune infiltration analysis in heart failure based on machine learning.

Wang X, Rao J, Zhang L, Liu X, Zhang Y Heliyon. 2024; 10(6):e27049.

PMID: 38509983 PMC: 10950509. DOI: 10.1016/j.heliyon.2024.e27049.


Identification of mG regulator-mediated RNA methylation modification patterns and related immune microenvironment regulation characteristics in heart failure.

Ma C, Tu D, Xu Q, Wu Y, Song X, Guo Z Clin Epigenetics. 2023; 15(1):22.

PMID: 36782329 PMC: 9926673. DOI: 10.1186/s13148-023-01439-3.


Fusing compressed deep ConvNets with a self-normalizing residual block and alpha dropout for a cost-efficient classification and diagnosis of gastrointestinal tract diseases.

Montalbo F MethodsX. 2022; 9:101925.

PMID: 36420314 PMC: 9677079. DOI: 10.1016/j.mex.2022.101925.


Multiclass Cancer Prediction Based on Copy Number Variation Using Deep Learning.

Attique H, Shah S, Jabeen S, Khan F, Khan A, ElAffendi M Comput Intell Neurosci. 2022; 2022:4742986.

PMID: 35720914 PMC: 9203194. DOI: 10.1155/2022/4742986.


References
1.
Greenman C, Stephens P, Smith R, Dalgliesh G, Hunter C, Bignell G . Patterns of somatic mutation in human cancer genomes. Nature. 2007; 446(7132):153-8. PMC: 2712719. DOI: 10.1038/nature05610. View

2.
Grossman R, Heath A, Ferretti V, Varmus H, Lowy D, Kibbe W . Toward a Shared Vision for Cancer Genomic Data. N Engl J Med. 2016; 375(12):1109-12. PMC: 6309165. DOI: 10.1056/NEJMp1607591. View

3.
LeCun Y, Bengio Y, Hinton G . Deep learning. Nature. 2015; 521(7553):436-44. DOI: 10.1038/nature14539. View

4.
Draminski M, Rada-Iglesias A, Enroth S, Wadelius C, Koronacki J, Komorowski J . Monte Carlo feature selection for supervised classification. Bioinformatics. 2007; 24(1):110-7. DOI: 10.1093/bioinformatics/btm486. View

5.
Ding X, Tsang S, Ng S, Xue H . Application of Machine Learning to Development of Copy Number Variation-based Prediction of Cancer Risk. Genomics Insights. 2015; 7:1-11. PMC: 4504076. DOI: 10.4137/GEI.S15002. View