Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic Uncertainty
Overview
Authors
Affiliations
Marginal likelihood estimates to compare models using Bayes factors frequently accompany Bayesian phylogenetic inference. Approaches to estimate marginal likelihoods have garnered increased attention over the past decade. In particular, the introduction of path sampling (PS) and stepping-stone sampling (SS) into Bayesian phylogenetics has tremendously improved the accuracy of model selection. These sampling techniques are now used to evaluate complex evolutionary and population genetic models on empirical data sets, but considerable computational demands hamper their widespread adoption. Further, when very diffuse, but proper priors are specified for model parameters, numerical issues complicate the exploration of the priors, a necessary step in marginal likelihood estimation using PS or SS. To avoid such instabilities, generalized SS (GSS) has recently been proposed, introducing the concept of "working distributions" to facilitate--or shorten--the integration process that underlies marginal likelihood estimation. However, the need to fix the tree topology currently limits GSS in a coalescent-based framework. Here, we extend GSS by relaxing the fixed underlying tree topology assumption. To this purpose, we introduce a "working" distribution on the space of genealogies, which enables estimating marginal likelihoods while accommodating phylogenetic uncertainty. We propose two different "working" distributions that help GSS to outperform PS and SS in terms of accuracy when comparing demographic and evolutionary models applied to synthetic data and real-world examples. Further, we show that the use of very diffuse priors can lead to a considerable overestimation in marginal likelihood when using PS and SS, while still retrieving the correct marginal likelihood using both GSS approaches. The methods used in this article are available in BEAST, a powerful user-friendly software package to perform Bayesian evolutionary analyses.
Molecular epidemiology, evolution, and transmission dynamics of raccoon rabies virus in Connecticut.
Veytsel G, Desiato J, Chung H, Tan S, Risatti G, Helal Z Virus Evol. 2025; 11(1):veae114.
PMID: 39802825 PMC: 11711587. DOI: 10.1093/ve/veae114.
Phylogeography and Evolutionary Dynamics of Tobacco Curly Shoot Virus.
Long X, Zhang S, Shen J, Du Z, Gao F Viruses. 2025; 16(12.
PMID: 39772160 PMC: 11680240. DOI: 10.3390/v16121850.
Tay J, Kocher A, Duchene S PLoS Comput Biol. 2024; 20(11):e1012371.
PMID: 39504312 PMC: 11573219. DOI: 10.1371/journal.pcbi.1012371.
Holtz A, Van Weyenbergh J, Hong S, Cuypers L, OToole A, Dudas G BMC Infect Dis. 2024; 24(1):1139.
PMID: 39390446 PMC: 11468156. DOI: 10.1186/s12879-024-09967-w.
Molecular epidemiology of recurrent zoonotic transmission of mpox virus in West Africa.
Djuicy D, Omah I, Parker E, Tomkins-Tinch C, Otieno J, Yifomnjou M medRxiv. 2024; .
PMID: 38947021 PMC: 11213044. DOI: 10.1101/2024.06.18.24309115.