» Articles » PMID: 39900763

Enhancing Robust and Stable Feature Selection Through the Integration of Ranking Methods and Wrapper Techniques in Genetic Data Classification

Overview
Date 2025 Feb 3
PMID 39900763
Authors
Affiliations
Soon will be listed here.
Abstract

High-dimensional data expands the spatial dimension, leading to increased computational complexity and reduced generalization performance. Microarray data classification, such as diagnosing diseases like cancer, involves complex dimensions due to their genetic and biological information. To address this issue, dimension reduction is essential for these data sets. The main goal of this chapter is to provide a method for dimension reduction and classification of genetic data sets. The proposed approach comprises multiple stages. Initially, various feature ranking methods are combined to improve the robustness and stability of the feature selection process. A hybrid ranking method, which incorporates gene interactions, is integrated with a wrapper method. Subsequently, a support vector machine (SVM) is employed for classification. To address class imbalance in the training data, a solution is implemented before feeding the data into the SVM classifier. The experimental outcomes of the proposed approach, tested on five microarray databases, indicate robust feature selection with a metric ranging from 0.70 to 0.88. Additionally, the classification accuracy falls within the range of 91-96%.

References
1.
Vadapalli S, Abdelhalim H, Zeeshan S, Ahmed Z . Artificial intelligence and machine learning approaches using gene expression and variant data for personalized medicine. Brief Bioinform. 2022; 23(5). PMC: 10233311. DOI: 10.1093/bib/bbac191. View

2.
Yang F, Mao K . Robust feature selection for microarray data based on multicriterion fusion. IEEE/ACM Trans Comput Biol Bioinform. 2011; 8(4):1080-92. DOI: 10.1109/TCBB.2010.103. View

3.
Singh D, Febbo P, Ross K, Jackson D, Manola J, Ladd C . Gene expression correlates of clinical prostate cancer behavior. Cancer Cell. 2002; 1(2):203-9. DOI: 10.1016/s1535-6108(02)00030-2. View

4.
Alon U, Barkai N, Notterman D, Gish K, Ybarra S, Mack D . Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc Natl Acad Sci U S A. 1999; 96(12):6745-50. PMC: 21986. DOI: 10.1073/pnas.96.12.6745. View

5.
Golub T, Slonim D, Tamayo P, Huard C, Gaasenbeek M, Mesirov J . Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science. 1999; 286(5439):531-7. DOI: 10.1126/science.286.5439.531. View