Improved Harmonic Mean Estimator for Phylogenetic Model Evidence
Overview
Molecular Biology
Authors
Affiliations
Bayesian phylogenetic methods are generating noticeable enthusiasm in the field of molecular systematics. Many phylogenetic models are often at stake, and different approaches are used to compare them within a Bayesian framework. The Bayes factor, defined as the ratio of the marginal likelihoods of two competing models, plays a key role in Bayesian model selection. We focus on an alternative estimator of the marginal likelihood whose computation is still a challenging problem. Several computational solutions have been proposed, none of which can be considered outperforming the others simultaneously in terms of simplicity of implementation, computational burden and precision of the estimates. Practitioners and researchers, often led by available software, have privileged so far the simplicity of the harmonic mean (HM) estimator. However, it is known that the resulting estimates of the Bayesian evidence in favor of one model are biased and often inaccurate, up to having an infinite variance so that the reliability of the corresponding conclusions is doubtful. We consider possible improvements of the generalized harmonic mean (GHM) idea that recycle Markov Chain Monte Carlo (MCMC) simulations from the posterior, share the computational simplicity of the original HM estimator, but, unlike it, overcome the infinite variance issue. We show reliability and comparative performance of the improved harmonic mean estimators comparing them to approximation techniques relying on improved variants of the thermodynamic integration.
Marginal Likelihoods in Phylogenetics: A Review of Methods and Applications.
Oaks J, Cobb K, Minin V, Leache A Syst Biol. 2019; 68(5):681-697.
PMID: 30668834 PMC: 6701458. DOI: 10.1093/sysbio/syz003.
Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic Uncertainty.
Baele G, Lemey P, Suchard M Syst Biol. 2015; 65(2):250-64.
PMID: 26526428 PMC: 5009437. DOI: 10.1093/sysbio/syv083.
Posterior predictive Bayesian phylogenetic model selection.
Lewis P, Xie W, Chen M, Fan Y, Kuo L Syst Biol. 2013; 63(3):309-21.
PMID: 24193892 PMC: 3985471. DOI: 10.1093/sysbio/syt068.