» Articles » PMID: 33154423

Unsupervised and Supervised Learning with Neural Network for Human Transcriptome Analysis and Cancer Diagnosis

Overview
Journal Sci Rep
Specialty Science
Date 2020 Nov 6
PMID 33154423
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Deep learning analysis of images and text unfolds new horizons in medicine. However, analysis of transcriptomic data, the cause of biological and pathological changes, is hampered by structural complexity distinctive from images and text. Here we conduct unsupervised training on more than 20,000 human normal and tumor transcriptomic data and show that the resulting Deep-Autoencoder, DeepT2Vec, has successfully extracted informative features and embedded transcriptomes into 30-dimensional Transcriptomic Feature Vectors (TFVs). We demonstrate that the TFVs could recapitulate expression patterns and be used to track tissue origins. Trained on these extracted features only, a supervised classifier, DeepC, can effectively distinguish tumors from normal samples with an accuracy of 90% for Pan-Cancer and reach an average 94% for specific cancers. Training on a connected network, the accuracy is further increased to 96% for Pan-Cancer. Together, our study shows that deep learning with autoencoder is suitable for transcriptomic analysis, and DeepT2Vec could be successfully applied to distinguish cancers, normal tissues, and other potential traits with limited samples.

Citing Articles

Infusing structural assumptions into dimensionality reduction for single-cell RNA sequencing data to identify small gene sets.

Hackenberg M, Brunn N, Vogel T, Binder H Commun Biol. 2025; 8(1):414.

PMID: 40069486 PMC: 11897155. DOI: 10.1038/s42003-025-07872-9.


Towards the Next Generation of Data-Driven Therapeutics Using Spatially Resolved Single-Cell Technologies and Generative AI.

Rodov A, Baniadam H, Zeiser R, Amit I, Yosef N, Wertheimer T Eur J Immunol. 2025; 55(2):e202451234.

PMID: 39964048 PMC: 11834372. DOI: 10.1002/eji.202451234.


Artificial Intelligence and Neuroscience: Transformative Synergies in Brain Research and Clinical Applications.

Onciul R, Tataru C, Dumitru A, Crivoi C, Serban M, Covache-Busuioc R J Clin Med. 2025; 14(2).

PMID: 39860555 PMC: 11766073. DOI: 10.3390/jcm14020550.


Augmenting small biomedical datasets using generative AI methods based on self-organizing neural networks.

Ultsch A, Lotsch J Brief Bioinform. 2024; 26(1).

PMID: 39658207 PMC: 11631343. DOI: 10.1093/bib/bbae640.


Shifting the Cancer Screening Paradigm: The Rising Potential of Blood-Based Multi-Cancer Early Detection Tests.

Brito-Rocha T, Constancio V, Henrique R, Jeronimo C Cells. 2023; 12(6).

PMID: 36980276 PMC: 10047029. DOI: 10.3390/cells12060935.


References
1.
Edgar R, Domrachev M, Lash A . Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2001; 30(1):207-10. PMC: 99122. DOI: 10.1093/nar/30.1.207. View

2.
Furey T, Cristianini N, Duffy N, Bednarski D, Schummer M, Haussler D . Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics. 2000; 16(10):906-14. DOI: 10.1093/bioinformatics/16.10.906. View

3.
Esteva A, Kuprel B, Novoa R, Ko J, Swetter S, Blau H . Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017; 542(7639):115-118. PMC: 8382232. DOI: 10.1038/nature21056. View

4.
Khan J, Wei J, Ringner M, Saal L, Ladanyi M, Westermann F . Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks. Nat Med. 2001; 7(6):673-9. PMC: 1282521. DOI: 10.1038/89044. View

5.
Wang Z, Monteiro C, Jagodnik K, Fernandez N, Gundersen G, Rouillard A . Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd. Nat Commun. 2016; 7:12846. PMC: 5052684. DOI: 10.1038/ncomms12846. View