» Articles » PMID: 29481013

Informative Gene Selection for Cancer Classification with Microarray Data Using a Metaheuristic Framework

Overview
Specialty Oncology
Date 2018 Feb 27
PMID 29481013
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: Cancer diagnosis is one of the most vital emerging clinical applications of microarray data. Due to the high dimensionality, gene selection is an important step for improving expression data classification performance. There is therefore a need for effective methods to select informative genes for prediction and diagnosis of cancer. The main objective of this research was to derive a heuristic approach to select highly informative genes. Methods: A metaheuristic approach with a Genetic Algorithm with Levy Flight (GA-LV) was applied for classification of cancer genes in microarrays. The experimental results were analyzed with five major cancer gene expression benchmark datasets. Result: GA-LV proved superior to GA and statistical approaches, with 100% accuracy for the dataset for Leukemia, Lung and Lymphoma. For Prostate and Colon datasets the GA-LV was 99.5% and 99.2% accurate, respectively. Conclusion: The experimental results show that the proposed approach is suitable for effective gene selection with all benchmark datasets, removing irrelevant and redundant genes to improve classification accuracy.

Citing Articles

Breast cancer prediction based on gene expression data using interpretable machine learning techniques.

Kallah-Dagadu G, Mohammed M, Nasejje J, Mchunu N, Twabi H, Batidzirai J Sci Rep. 2025; 15(1):7594.

PMID: 40038307 PMC: 11880515. DOI: 10.1038/s41598-025-85323-5.


A Dual Level Analysis with Evolutionary Computing and Swarm Models for Classification of Leukemia.

Prabhakar S, Ryu S, Jeong I, Won D Biomed Res Int. 2022; 2022:2052061.

PMID: 35663047 PMC: 9162867. DOI: 10.1155/2022/2052061.

References
1.
Xiong M, Li W, Zhao J, Jin L, Boerwinkle E . Feature (gene) selection in gene expression-based tumor classification. Mol Genet Metab. 2001; 73(3):239-47. DOI: 10.1006/mgme.2001.3193. View

2.
Duan K, Rajapakse J, Wang H, Azuaje F . Multiple SVM-RFE for gene selection in cancer classification with expression data. IEEE Trans Nanobioscience. 2005; 4(3):228-34. DOI: 10.1109/tnb.2005.853657. View

3.
Bilban M, Buehler L, Head S, Desoye G, Quaranta V . Normalizing DNA microarray data. Curr Issues Mol Biol. 2002; 4(2):57-64. View

4.
Chen L, Qu X, Cao M, Zhou Y, Li W, Liang B . Identification of breast cancer patients based on human signaling network motifs. Sci Rep. 2013; 3:3368. PMC: 3842546. DOI: 10.1038/srep03368. View

5.
Li Y, Kang K, Krahn J, Croutwater N, Lee K, Umbach D . A comprehensive genomic pan-cancer classification using The Cancer Genome Atlas gene expression data. BMC Genomics. 2017; 18(1):508. PMC: 5496318. DOI: 10.1186/s12864-017-3906-0. View