» Articles » PMID: 29535919

Improving Classification of Cancer and Mining Biomarkers from Gene Expression Profiles Using Hybrid Optimization Algorithms and Fuzzy Support Vector Machine

Overview
Date 2018 Mar 15
PMID 29535919
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Gene expression data are characteristically high dimensional with a small sample size in contrast to the feature size and variability inherent in biological processes that contribute to difficulties in analysis. Selection of highly discriminative features decreases the computational cost and complexity of the classifier and improves its reliability for prediction of a new class of samples.

Methods: The present study used hybrid particle swarm optimization and genetic algorithms for gene selection and a fuzzy support vector machine (SVM) as the classifier. Fuzzy logic is used to infer the importance of each sample in the training phase and decrease the outlier sensitivity of the system to increase the ability to generalize the classifier. A decision-tree algorithm was applied to the most frequent genes to develop a set of rules for each type of cancer. This improved the abilities of the algorithm by finding the best parameters for the classifier during the training phase without the need for trial-and-error by the user. The proposed approach was tested on four benchmark gene expression profiles.

Results: Good results have been demonstrated for the proposed algorithm. The classification accuracy for leukemia data is 100%, for colon cancer is 96.67% and for breast cancer is 98%. The results show that the best kernel used in training the SVM classifier is the radial basis function.

Conclusions: The experimental results show that the proposed algorithm can decrease the dimensionality of the dataset, determine the most informative gene subset, and improve classification accuracy using the optimal parameters of the classifier with no user interface.

Citing Articles

Monkey king evolution (MKE)-GA-SVM model for subtype classification of breast cancer.

Sarkar S, Mali K Digit Health. 2024; 10:20552076241297002.

PMID: 39659402 PMC: 11629432. DOI: 10.1177/20552076241297002.


Cancer Diagnosis through Contour Visualization of Gene Expression Leveraging Deep Learning Techniques.

Venkatesan V, Kuppusamy Murugesan K, Chandrasekaran K, Ramakrishna M, Khan S, Almusharraf A Diagnostics (Basel). 2023; 13(22).

PMID: 37998588 PMC: 10670706. DOI: 10.3390/diagnostics13223452.


Cardiac tissue engineering: state-of-the-art methods and outlook.

Nguyen A, Marsh P, Schmiess-Heine L, Burke P, Lee A, Lee J J Biol Eng. 2019; 13:57.

PMID: 31297148 PMC: 6599291. DOI: 10.1186/s13036-019-0185-0.

References
1.
Chu F, Wang L . Applications of support vector machines to cancer classification with microarray data. Int J Neural Syst. 2005; 15(6):475-84. DOI: 10.1142/S0129065705000396. View

2.
Alon U, Barkai N, Notterman D, Gish K, Ybarra S, Mack D . Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc Natl Acad Sci U S A. 1999; 96(12):6745-50. PMC: 21986. DOI: 10.1073/pnas.96.12.6745. View

3.
Shen Q, Shi W, Kong W, Ye B . A combination of modified particle swarm optimization algorithm and support vector machine for gene selection and tumor classification. Talanta. 2008; 71(4):1679-83. DOI: 10.1016/j.talanta.2006.07.047. View

4.
Shen Q, Mei Z, Ye B . Simultaneous genes and training samples selection by modified particle swarm optimization for gene expression data classification. Comput Biol Med. 2009; 39(7):646-9. DOI: 10.1016/j.compbiomed.2009.04.008. View

5.
Schena M, Shalon D, Davis R, Brown P . Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science. 1995; 270(5235):467-70. DOI: 10.1126/science.270.5235.467. View