» Articles » PMID: 24244802

Identification of Significant Features in DNA Microarray Data

Overview
Date 2013 Nov 19
PMID 24244802
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

DNA microarrays are a relatively new technology that can simultaneously measure the expression level of thousands of genes. They have become an important tool for a wide variety of biological experiments. One of the most common goals of DNA microarray experiments is to identify genes associated with biological processes of interest. Conventional statistical tests often produce poor results when applied to microarray data owing to small sample sizes, noisy data, and correlation among the expression levels of the genes. Thus, novel statistical methods are needed to identify significant genes in DNA microarray experiments. This article discusses the challenges inherent in DNA microarray analysis and describes a series of statistical techniques that can be used to overcome these challenges. The problem of multiple hypothesis testing and its relation to microarray studies are also considered, along with several possible solutions.

Citing Articles

An enhanced topologically significant directed random walk in cancer classification using gene expression datasets.

Seah C, Kasim S, Fudzee M, Law Tze Ping J, Mohamad M, Saedudin R Saudi J Biol Sci. 2018; 24(8):1828-1841.

PMID: 29551932 PMC: 5851940. DOI: 10.1016/j.sjbs.2017.11.024.


A regression-based differential expression detection algorithm for microarray studies with ultra-low sample size.

Vasiliu D, Clamons S, McDonough M, Rabe B, Saha M PLoS One. 2015; 10(3):e0118198.

PMID: 25738861 PMC: 4349782. DOI: 10.1371/journal.pone.0118198.

References
1.
Draghici S, Khatri P, Bhavsar P, Shah A, Krawetz S, Tainsky M . Onto-Tools, the toolkit of the modern biologist: Onto-Express, Onto-Compare, Onto-Design and Onto-Translate. Nucleic Acids Res. 2003; 31(13):3775-81. PMC: 169030. DOI: 10.1093/nar/gkg624. View

2.
Pawitan Y, Calza S, Ploner A . Estimation of false discovery proportion under general dependence. Bioinformatics. 2006; 22(24):3025-31. DOI: 10.1093/bioinformatics/btl527. View

3.
Maugis C, Celeux G, Martin-Magniette M . Variable selection for clustering with Gaussian mixture models. Biometrics. 2009; 65(3):701-9. DOI: 10.1111/j.1541-0420.2008.01160.x. View

4.
Huber W, von Heydebreck A, Sultmann H, Poustka A, Vingron M . Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics. 2002; 18 Suppl 1:S96-104. DOI: 10.1093/bioinformatics/18.suppl_1.s96. View

5.
ARFIN S, Long A, Ito E, Tolleri L, Riehle M, Paegle E . Global gene expression profiling in Escherichia coli K12. The effects of integration host factor. J Biol Chem. 2000; 275(38):29672-84. DOI: 10.1074/jbc.M002247200. View