A Bayesian Mixture Model for Metaanalysis of Microarray Studies
Overview
Molecular Biology
Authors
Affiliations
The increased availability of microarray data has been calling for statistical methods to integrate findings across studies. A common goal of microarray analysis is to determine differentially expressed genes between two conditions, such as treatment vs control. A recent Bayesian metaanalysis model used a prior distribution for the mean log-expression ratios that was a mixture of two normal distributions. This model centered the prior distribution of differential expression at zero, and separated genes into two groups only: expressed and nonexpressed. Here, we introduce a Bayesian three-component truncated normal mixture prior model that more flexibly assigns prior distributions to the differentially expressed genes and produces three groups of genes: up and downregulated, and nonexpressed. We found in simulations of two and five studies that the three-component model outperformed the two-component model using three comparison measures. When analyzing biological data of Bacillus subtilis, we found that the three-component model discovered more genes and omitted fewer genes for the same levels of posterior probability of differential expression than the two-component model, and discovered more genes for fixed thresholds of Bayesian false discovery. We assumed that the data sets were produced from the same microarray platform and were prescaled.
On integrating multi-experiment microarray data.
Tsiliki G, Vlachakis D, Kossida S Philos Trans A Math Phys Eng Sci. 2014; 372(2016):20130136.
PMID: 24751870 PMC: 3996576. DOI: 10.1098/rsta.2013.0136.
Tsoi L, Qin T, Slate E, Zheng W BMC Bioinformatics. 2011; 12:438.
PMID: 22078224 PMC: 3251006. DOI: 10.1186/1471-2105-12-438.
Schmidberger M, Lennert S, Mansmann U Bioinform Biol Insights. 2011; 5:13-39.
PMID: 21423405 PMC: 3045047. DOI: 10.4137/BBI.S5537.
A Bayesian model for cross-study differential gene expression.
Scharpf R, Tjelmeland H, Parmigiani G, Nobel A J Am Stat Assoc. 2010; 104(488):1295-1310.
PMID: 21127725 PMC: 2994029. DOI: 10.1198/jasa.2009.ap07611.
Candidate pathways and genes for prostate cancer: a meta-analysis of gene expression data.
Gorlov I, Byun J, Gorlova O, Aparicio A, Efstathiou E, Logothetis C BMC Med Genomics. 2009; 2:48.
PMID: 19653896 PMC: 2731785. DOI: 10.1186/1755-8794-2-48.