Enhancing Robust and Stable Feature Selection Through the Integration of Ranking Methods and Wrapper Techniques in Genetic Data Classification
Overview
Affiliations
High-dimensional data expands the spatial dimension, leading to increased computational complexity and reduced generalization performance. Microarray data classification, such as diagnosing diseases like cancer, involves complex dimensions due to their genetic and biological information. To address this issue, dimension reduction is essential for these data sets. The main goal of this chapter is to provide a method for dimension reduction and classification of genetic data sets. The proposed approach comprises multiple stages. Initially, various feature ranking methods are combined to improve the robustness and stability of the feature selection process. A hybrid ranking method, which incorporates gene interactions, is integrated with a wrapper method. Subsequently, a support vector machine (SVM) is employed for classification. To address class imbalance in the training data, a solution is implemented before feeding the data into the SVM classifier. The experimental outcomes of the proposed approach, tested on five microarray databases, indicate robust feature selection with a metric ranging from 0.70 to 0.88. Additionally, the classification accuracy falls within the range of 91-96%.