Bayesian Modeling of Differential Gene Expression
Overview
Affiliations
We present a Bayesian hierarchical model for detecting differentially expressing genes that includes simultaneous estimation of array effects, and show how to use the output for choosing lists of genes for further investigation. We give empirical evidence that expression-level dependent array effects are needed, and explore different nonlinear functions as part of our model-based approach to normalization. The model includes gene-specific variances but imposes some necessary shrinkage through a hierarchical structure. Model criticism via posterior predictive checks is discussed. Modeling the array effects (normalization) simultaneously with differential expression gives fewer false positive results. To choose a list of genes, we propose to combine various criteria (for instance, fold change and overall expression) into a single indicator variable for each gene. The posterior distribution of these variables is used to pick the list of genes, thereby taking into account uncertainty in parameter estimates. In an application to mouse knockout data, Gene Ontology annotations over- and underrepresented among the genes on the chosen list are consistent with biological expectations.
Oh V, Li R Adv Sci (Weinh). 2024; 11(47):e2400458.
PMID: 39535493 PMC: 11653615. DOI: 10.1002/advs.202400458.
Large-Scale Meta-Longitudinal Microbiome Data with a Known Batch Factor.
Oh V, Li R Genes (Basel). 2022; 13(3).
PMID: 35327945 PMC: 8953633. DOI: 10.3390/genes13030392.
scMET: Bayesian modeling of DNA methylation heterogeneity at single-cell resolution.
Kapourani C, Argelaguet R, Sanguinetti G, Vallejos C Genome Biol. 2021; 22(1):114.
PMID: 33879195 PMC: 8056718. DOI: 10.1186/s13059-021-02329-8.
Li B, Sun Z, He Q, Zhu Y, Qin Z Bioinformatics. 2015; 32(5):682-9.
PMID: 26519502 PMC: 4907396. DOI: 10.1093/bioinformatics/btv631.
Jung K, Friede T, Beissbarth T BMC Bioinformatics. 2011; 12:288.
PMID: 21756370 PMC: 3154206. DOI: 10.1186/1471-2105-12-288.