» Articles » PMID: 26526428

Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic Uncertainty

Overview
Journal Syst Biol
Specialty Biology
Date 2015 Nov 4
PMID 26526428
Citations 63
Authors
Affiliations
Soon will be listed here.
Abstract

Marginal likelihood estimates to compare models using Bayes factors frequently accompany Bayesian phylogenetic inference. Approaches to estimate marginal likelihoods have garnered increased attention over the past decade. In particular, the introduction of path sampling (PS) and stepping-stone sampling (SS) into Bayesian phylogenetics has tremendously improved the accuracy of model selection. These sampling techniques are now used to evaluate complex evolutionary and population genetic models on empirical data sets, but considerable computational demands hamper their widespread adoption. Further, when very diffuse, but proper priors are specified for model parameters, numerical issues complicate the exploration of the priors, a necessary step in marginal likelihood estimation using PS or SS. To avoid such instabilities, generalized SS (GSS) has recently been proposed, introducing the concept of "working distributions" to facilitate--or shorten--the integration process that underlies marginal likelihood estimation. However, the need to fix the tree topology currently limits GSS in a coalescent-based framework. Here, we extend GSS by relaxing the fixed underlying tree topology assumption. To this purpose, we introduce a "working" distribution on the space of genealogies, which enables estimating marginal likelihoods while accommodating phylogenetic uncertainty. We propose two different "working" distributions that help GSS to outperform PS and SS in terms of accuracy when comparing demographic and evolutionary models applied to synthetic data and real-world examples. Further, we show that the use of very diffuse priors can lead to a considerable overestimation in marginal likelihood when using PS and SS, while still retrieving the correct marginal likelihood using both GSS approaches. The methods used in this article are available in BEAST, a powerful user-friendly software package to perform Bayesian evolutionary analyses.

Citing Articles

Molecular epidemiology, evolution, and transmission dynamics of raccoon rabies virus in Connecticut.

Veytsel G, Desiato J, Chung H, Tan S, Risatti G, Helal Z Virus Evol. 2025; 11(1):veae114.

PMID: 39802825 PMC: 11711587. DOI: 10.1093/ve/veae114.


Phylogeography and Evolutionary Dynamics of Tobacco Curly Shoot Virus.

Long X, Zhang S, Shen J, Du Z, Gao F Viruses. 2025; 16(12.

PMID: 39772160 PMC: 11680240. DOI: 10.3390/v16121850.


Assessing the effect of model specification and prior sensitivity on Bayesian tests of temporal signal.

Tay J, Kocher A, Duchene S PLoS Comput Biol. 2024; 20(11):e1012371.

PMID: 39504312 PMC: 11573219. DOI: 10.1371/journal.pcbi.1012371.


Emergence of the B.1.214.2 SARS-CoV-2 lineage with an Omicron-like spike insertion and a unique upper airway immune signature.

Holtz A, Van Weyenbergh J, Hong S, Cuypers L, OToole A, Dudas G BMC Infect Dis. 2024; 24(1):1139.

PMID: 39390446 PMC: 11468156. DOI: 10.1186/s12879-024-09967-w.


Molecular epidemiology of recurrent zoonotic transmission of mpox virus in West Africa.

Djuicy D, Omah I, Parker E, Tomkins-Tinch C, Otieno J, Yifomnjou M medRxiv. 2024; .

PMID: 38947021 PMC: 11213044. DOI: 10.1101/2024.06.18.24309115.


References
1.
Lartillot N, Philippe H . Computing Bayes factors using thermodynamic integration. Syst Biol. 2006; 55(2):195-207. DOI: 10.1080/10635150500433722. View

2.
Lepage T, Bryant D, Philippe H, Lartillot N . A general comparison of relaxed molecular clock models. Mol Biol Evol. 2007; 24(12):2669-80. DOI: 10.1093/molbev/msm193. View

3.
Gill M, Lemey P, Faria N, Rambaut A, Shapiro B, Suchard M . Improving Bayesian population dynamics inference: a coalescent-based model for multiple loci. Mol Biol Evol. 2012; 30(3):713-24. PMC: 3563973. DOI: 10.1093/molbev/mss265. View

4.
Worobey M, Gemmel M, Teuwen D, Haselkorn T, Kunstman K, Bunce M . Direct evidence of extensive diversity of HIV-1 in Kinshasa by 1960. Nature. 2008; 455(7213):661-4. PMC: 3682493. DOI: 10.1038/nature07390. View

5.
Sinsheimer J, Lake J, Little R . Bayesian hypothesis testing of four-taxon topologies using molecular sequence data. Biometrics. 1996; 52(1):193-210. View