» Articles » PMID: 17702762

Filtering Genes to Improve Sensitivity in Oligonucleotide Microarray Data Analysis

Overview
Specialty Biochemistry
Date 2007 Aug 19
PMID 17702762
Citations 24
Authors
Affiliations
Soon will be listed here.
Abstract

Many recent microarrays hold an enormous number of probe sets, thus raising many practical and theoretical problems in controlling the false discovery rate (FDR). Biologically, it is likely that most probe sets are associated with un-expressed genes, so the measured values are simply noise due to non-specific binding; also many probe sets are associated with non-differentially-expressed (non-DE) genes. In an analysis to find DE genes, these probe sets contribute to the false discoveries, so it is desirable to filter out these probe sets prior to analysis. In the methodology proposed here, we first fit a robust linear model for probe-level Affymetrix data that accounts for probe and array effects. We then develop a novel procedure called FLUSH (Filtering Likely Uninformative Sets of Hybridizations), which excludes probe sets that have statistically small array-effects or large residual variance. This filtering procedure was evaluated on a publicly available data set from a controlled spiked-in experiment, as well as on a real experimental data set of a mouse model for retinal degeneration. In both cases, FLUSH filtering improves the sensitivity in the detection of DE genes compared to analyses using unfiltered, presence-filtered, intensity-filtered and variance-filtered data. A freely-available package called FLUSH implements the procedures and graphical displays described in the article.

Citing Articles

A model-based clustering via mixture of hierarchical models with covariate adjustment for detecting differentially expressed genes from paired design.

Zhang Y, Liu W, Qiu W BMC Bioinformatics. 2023; 24(1):423.

PMID: 37940858 PMC: 10633962. DOI: 10.1186/s12859-023-05556-x.


Data-driven analysis and druggability assessment methods to accelerate the identification of novel cancer targets.

Beis G, Serafeim A, Papasotiriou I Comput Struct Biotechnol J. 2022; 21:46-57.

PMID: 36514341 PMC: 9732000. DOI: 10.1016/j.csbj.2022.11.042.


A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression.

Rohr M, Beardsley J, Nakkina S, Zhu X, Aljabban J, Hadley D Sci Data. 2021; 8(1):214.

PMID: 34381057 PMC: 8358057. DOI: 10.1038/s41597-021-00998-5.


Normalization Methods for the Analysis of Unbalanced Transcriptome Data: A Review.

Liu X, Li N, Liu S, Wang J, Zhang N, Zheng X Front Bioeng Biotechnol. 2020; 7:358.

PMID: 32039167 PMC: 6988798. DOI: 10.3389/fbioe.2019.00358.


Whole transcriptional analysis identifies markers of B, T and plasma cell signaling pathways in the mesenteric adipose tissue associated with Crohn's disease.

da Silva F, Pascoal L, Dotti I, Ayrizono M, Aguilar D, Rodrigues B J Transl Med. 2020; 18(1):44.

PMID: 32000799 PMC: 6993458. DOI: 10.1186/s12967-020-02220-3.


References
1.
Aston C, Jiang L, Sokolov B . Transcriptional profiling reveals evidence for signaling and oligodendroglial abnormalities in the temporal cortex from patients with major depressive disorder. Mol Psychiatry. 2004; 10(3):309-22. DOI: 10.1038/sj.mp.4001565. View

2.
Zhou J, Rappaport E, Tobias J, Young T . Differential gene expression in mouse sclera during ocular development. Invest Ophthalmol Vis Sci. 2006; 47(5):1794-802. DOI: 10.1167/iovs.05-0759. View

3.
Furukawa T, Morrow E, Cepko C . Crx, a novel otx-like homeobox gene, shows photoreceptor-specific expression and regulates photoreceptor differentiation. Cell. 1997; 91(4):531-41. DOI: 10.1016/s0092-8674(00)80439-0. View

4.
Blackshaw S, Harpavat S, Trimarchi J, Cai L, Huang H, Kuo W . Genomic analysis of mouse retinal development. PLoS Biol. 2004; 2(9):E247. PMC: 439783. DOI: 10.1371/journal.pbio.0020247. View

5.
Irizarry R, Bolstad B, Collin F, Cope L, Hobbs B, Speed T . Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 2003; 31(4):e15. PMC: 150247. DOI: 10.1093/nar/gng015. View