» Articles » PMID: 14578198

Accurate Molecular Classification of Human Cancers Based on Gene Expression Using a Simple Classifier with a Pathological Tree-based Framework

Abstract

Recent studies suggest accurate prediction of tissue of origin for human cancers can be achieved by applying sophisticated statistical learning procedures to gene expression data obtained from DNA microarrays. We have pursued the hypothesis that a more straightforward and equally accurate strategy for classifying human tumors is to use a simple algorithm that considers gene expression levels within a tree-based framework that encodes limited information about pathology and tissue ontogeny. By considering gene expression data within this framework, we found only a small number of genes were required to achieve a relatively high accuracy level in tumor classification. Using as few as 45 genes we were able to classify 157 of 190 human malignant tumors correctly, which is comparable to previous results obtained with sophisticated classifiers using thousands of genes. Our simple classifier accurately predicted the origin of metastatic tumors even when the classifier was trained using only primary tumors, and the classifier produced accurate predictions when trained and tested on expression data from different labs, and from different microarray platforms. Our findings suggest that accurate and robust cancer diagnosis from gene expression profiles can be achieved by mimicking the classification strategies routinely used by surgical pathologists.

Citing Articles

Dehydrogenase/reductase SDR family member 2 silencing sensitizes an oxaliplatin‑resistant cell line to oxaliplatin by inhibiting excision repair cross‑complementing group 1 protein expression.

Li J, Jiang G, Zhao L, Yang F, Yuan W, Wang H Oncol Rep. 2019; 42(5):1725-1734.

PMID: 31436301 PMC: 6775812. DOI: 10.3892/or.2019.7291.


The Wide World of Molecular Profiling for Tumor Classification.

Hanash S Clin Chem. 2017; 64(4):743-744.

PMID: 29097505 PMC: 6425933. DOI: 10.1373/clinchem.2017.273466.


RNA-Seq accurately identifies cancer biomarker signatures to distinguish tissue of origin.

Wei I, Shi Y, Jiang H, Kumar-Sinha C, Chinnaiyan A Neoplasia. 2014; 16(11):918-27.

PMID: 25425966 PMC: 4240918. DOI: 10.1016/j.neo.2014.09.007.


Gene expression profiling in patients with carcinoma of unknown primary site: from translational research to standard of care.

Hainsworth J, Greco F Virchows Arch. 2014; 464(4):393-402.

PMID: 24487792 DOI: 10.1007/s00428-014-1545-2.


A microarray-based gene expression analysis to identify diagnostic biomarkers for unknown primary cancer.

Kurahashi I, Fujita Y, Arao T, Kurata T, Koh Y, Sakai K PLoS One. 2013; 8(5):e63249.

PMID: 23671674 PMC: 3650062. DOI: 10.1371/journal.pone.0063249.


References
1.
Giuffrida D, Gharib H . Anaplastic thyroid carcinoma: current diagnosis and treatment. Ann Oncol. 2000; 11(9):1083-9. DOI: 10.1023/a:1008322002520. View

2.
Yeang C, Ramaswamy S, Tamayo P, Mukherjee S, Rifkin R, Angelo M . Molecular classification of multiple tumor types. Bioinformatics. 2001; 17 Suppl 1:S316-22. DOI: 10.1093/bioinformatics/17.suppl_1.s316. View

3.
Rickman D, Bobek M, Misek D, Kuick R, Blaivas M, Kurnit D . Distinctive molecular profiles of high-grade and low-grade gliomas based on oligonucleotide microarray analysis. Cancer Res. 2001; 61(18):6885-91. View

4.
Giordano T, Shedden K, Schwartz D, Kuick R, Taylor J, Lee N . Organ-specific molecular classification of primary lung, colon, and ovarian adenocarcinomas using gene expression profiles. Am J Pathol. 2001; 159(4):1231-8. PMC: 1850519. DOI: 10.1016/S0002-9440(10)62509-6. View

5.
Su A, Welsh J, Sapinoso L, Kern S, Dimitrov P, Lapp H . Molecular classification of human carcinomas by use of gene expression signatures. Cancer Res. 2001; 61(20):7388-93. View