» Articles » PMID: 25986937

Multi-class BCGA-ELM Based Classifier That Identifies Biomarkers Associated with Hallmarks of Cancer

Overview
Publisher Biomed Central
Specialty Biology
Date 2015 May 20
PMID 25986937
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Traditional cancer treatments have centered on cytotoxic drugs and general purpose chemotherapy that may not be tailored to treat specific cancers. Identification of molecular markers that are related to different types of cancers might lead to discovery of drugs that are patient and disease specific. This study aims to use microarray gene expression cancer data to identify biomarkers that are indicative of different types of cancers. Our aim is to provide a multi-class cancer classifier that can simultaneously differentiate between cancers and identify type-specific biomarkers, through the application of the Binary Coded Genetic Algorithm (BCGA) and a neural network based Extreme Learning Machine (ELM) algorithm.

Results: BCGA and ELM are combined and used to select a subset of genes that are present in the Global Cancer Mapping (GCM) data set. This set of candidate genes contains over 52 biomarkers that are related to multiple cancers, according to the literature. They include APOA1, VEGFC, YWHAZ, B2M, EIF2S1, CCR9 and many other genes that have been associated with the hallmarks of cancer. BCGA-ELM is tested on several cancer data sets and the results are compared to other classification methods. BCGA-ELM compares or exceeds other algorithms in terms of accuracy. We were also able to show that over 50% of genes selected by BCGA-ELM on GCM data are cancer related biomarkers.

Conclusions: We were able to simultaneously differentiate between 14 different types of cancers, using only 92 genes, to achieve a multi-class classification accuracy of 95.4% which is between 21.6% and 38% higher than other results in the literature for multi-class cancer classification. Our findings suggest that computational algorithms such as BCGA-ELM can facilitate biomarker-driven integrated cancer research that can lead to a detailed understanding of the complexities of cancer.

Citing Articles

Development and Validation of the Predictive Model for Esophageal Squamous Cell Carcinoma Differentiation Degree.

Wang Y, Yang Y, Sun J, Wang L, Song X, Zhao X Front Genet. 2020; 11:595638.

PMID: 33193745 PMC: 7645151. DOI: 10.3389/fgene.2020.595638.


Random Subspace Aggregation for Cancer Prediction with Gene Expression Profiles.

Yang L, Liu Z, Yuan X, Wei J, Zhang J Biomed Res Int. 2016; 2016:4596326.

PMID: 27999797 PMC: 5143691. DOI: 10.1155/2016/4596326.

References
1.
Wang L, Chu F, Xie W . Accurate cancer classification using expressions of very few genes. IEEE/ACM Trans Comput Biol Bioinform. 2007; 4(1):40-53. DOI: 10.1109/TCBB.2007.1006. View

2.
Hanahan D, Weinberg R . Hallmarks of cancer: the next generation. Cell. 2011; 144(5):646-74. DOI: 10.1016/j.cell.2011.02.013. View

3.
Sun Y, Todorovic S, Goodison S . Local-learning-based feature selection for high-dimensional data analysis. IEEE Trans Pattern Anal Mach Intell. 2010; 32(9):1610-26. PMC: 3445441. DOI: 10.1109/TPAMI.2009.190. View

4.
Ramaswamy S, Tamayo P, Rifkin R, Mukherjee S, Yeang C, Angelo M . Multiclass cancer diagnosis using tumor gene expression signatures. Proc Natl Acad Sci U S A. 2001; 98(26):15149-54. PMC: 64998. DOI: 10.1073/pnas.211566398. View

5.
Zhang J, Deng H . Gene selection for classification of microarray data based on the Bayes error. BMC Bioinformatics. 2007; 8(1):370. PMC: 2089123. DOI: 10.1186/1471-2105-8-370. View