» Articles » PMID: 12603017

Improved Gene Selection for Classification of Microarrays

Overview
Publisher World Scientific
Specialty Biology
Date 2003 Feb 27
PMID 12603017
Citations 32
Authors
Affiliations
Soon will be listed here.
Abstract

In this paper we derive a method for evaluating and improving techniques for selecting informative genes from microarray data. Genes of interest are typically selected by ranking genes according to a test-statistic and then choosing the top k genes. A problem with this approach is that many of these genes are highly correlated. For classification purposes it would be ideal to have distinct but still highly informative genes. We propose three different pre-filter methods--two based on clustering and one based on correlation--to retrieve groups of similar genes. For these groups we apply a test-statistic to finally select genes of interest. We show that this filtered set of genes can be used to significantly improve existing classifiers.

Citing Articles

A review of machine learning methods for cancer characterization from microbiome data.

Teixeira M, Silva F, Ferreira R, Pereira T, Figueiredo C, Oliveira H NPJ Precis Oncol. 2024; 8(1):123.

PMID: 38816569 PMC: 11139966. DOI: 10.1038/s41698-024-00617-7.


Enhancement of Classifier Performance with Adam and RanAdam Hyper-Parameter Tuning for Lung Cancer Detection from Microarray Data-In Pursuit of Precision.

M S K, Rajaguru H, Nair A Bioengineering (Basel). 2024; 11(4).

PMID: 38671736 PMC: 11047746. DOI: 10.3390/bioengineering11040314.


Predicting the pathogenicity of bacterial genomes using widely spread protein families.

Naor-Hoffmann S, Svetlitsky D, Sal-Man N, Orenstein Y, Ziv-Ukelson M BMC Bioinformatics. 2022; 23(1):253.

PMID: 35751023 PMC: 9233384. DOI: 10.1186/s12859-022-04777-w.


A framework model using multifilter feature selection to enhance colon cancer classification.

Al-Rajab M, Lu J, Xu Q PLoS One. 2021; 16(4):e0249094.

PMID: 33861766 PMC: 8691854. DOI: 10.1371/journal.pone.0249094.


Bayesian Hyper-LASSO Classification for Feature Selection with Application to Endometrial Cancer RNA-seq Data.

Jiang L, Greenwood C, Yao W, Li L Sci Rep. 2020; 10(1):9747.

PMID: 32546735 PMC: 7297975. DOI: 10.1038/s41598-020-66466-z.