Empirical Bayes Microarray ANOVA and Grouping Cell Lines by Equal Expression Levels
Overview
Molecular Biology
Public Health
Affiliations
In the exploding field of gene expression techniques such as DNA microarrays, there are still few general probabilistic methods for analysis of variance. Linear models and ANOVA are heavily used tools in many other disciplines of scientific research. The usual F-statistic is unsatisfactory for microarray data, which explore many thousand genes in parallel, with few replicates. We present three potential one-way ANOVA statistics in a parametric statistical framework. The aim is to separate genes that are differently regulated across several treatment conditions from those with equal regulation. The statistics have different features and are evaluated using both real and simulated data. Our statistic B1 generally shows the best performance, and is extended for use in an algorithm that groups cell lines by equal expression levels for each gene. An extension is also outlined for more general ANOVA tests including several factors. The methods presented are implemented in the freely available statistical language R. They are available at http://www.math.uu.se/staff/pages/?uname=ingrid.
Kupfer P, Huber R, Weber M, Vlaic S, Haupl T, Koczan D BMC Med Genomics. 2014; 7:40.
PMID: 24989895 PMC: 4099018. DOI: 10.1186/1755-8794-7-40.
Metabolomics of ApcMin/+ mice genetically susceptible to intestinal cancer.
Dazard J, Sandlers Y, Doerner S, Berger N, Brunengraber H BMC Syst Biol. 2014; 8:72.
PMID: 24954394 PMC: 4099115. DOI: 10.1186/1752-0509-8-72.
Metabolomic Analysis of Liver Tissue from the VX2 Rabbit Model of Secondary Liver Tumors.
Ibarra R, Dazard J, Sandlers Y, Rehman F, Abbas R, Kombu R HPB Surg. 2014; 2014:310372.
PMID: 24723740 PMC: 3958765. DOI: 10.1155/2014/310372.
Kupfer P, Guthke R, Pohlers D, Huber R, Koczan D, Kinne R BMC Med Genomics. 2012; 5:23.
PMID: 22682473 PMC: 3528008. DOI: 10.1186/1755-8794-5-23.
baySeq: empirical Bayesian methods for identifying differential expression in sequence count data.
Hardcastle T, Kelly K BMC Bioinformatics. 2010; 11:422.
PMID: 20698981 PMC: 2928208. DOI: 10.1186/1471-2105-11-422.