» Articles » PMID: 33406072

Sampling Bias and Model Choice in Continuous Phylogeography: Getting Lost on a Random Walk

Overview
Specialty Biology
Date 2021 Jan 6
PMID 33406072
Citations 39
Authors
Affiliations
Soon will be listed here.
Abstract

Phylogeographic inference allows reconstruction of past geographical spread of pathogens or living organisms by integrating genetic and geographic data. A popular model in continuous phylogeography-with location data provided in the form of latitude and longitude coordinates-describes spread as a Brownian motion (Brownian Motion Phylogeography, BMP) in continuous space and time, akin to similar models of continuous trait evolution. Here, we show that reconstructions using this model can be strongly affected by sampling biases, such as the lack of sampling from certain areas. As an attempt to reduce the effects of sampling bias on BMP, we consider the addition of sequence-free samples from under-sampled areas. While this approach alleviates the effects of sampling bias, in most scenarios this will not be a viable option due to the need for prior knowledge of an outbreak's spatial distribution. We therefore consider an alternative model, the spatial Λ-Fleming-Viot process (ΛFV), which has recently gained popularity in population genetics. Despite the ΛFV's robustness to sampling biases, we find that the different assumptions of the ΛFV and BMP models result in different applicabilities, with the ΛFV being more appropriate for scenarios of endemic spread, and BMP being more appropriate for recent outbreaks or colonizations.

Citing Articles

The promise and challenge of spatial inference with the full ancestral recombination graph under Brownian motion.

Deraje P, Kitchens J, Coop G, Osmond M bioRxiv. 2025; .

PMID: 40027772 PMC: 11870416. DOI: 10.1101/2024.04.10.588900.


Association of poultry vaccination with interspecies transmission and molecular evolution of H5 subtype avian influenza virus.

Li B, Raghwani J, Hill S, Francois S, Lefrancq N, Liang Y Sci Adv. 2025; 11(4):eado9140.

PMID: 39841843 PMC: 11753422. DOI: 10.1126/sciadv.ado9140.


spread.gl: visualizing pathogen dispersal in a high-performance browser application.

Li Y, Bollen N, Hong S, Brusselmans M, Gambaro F, Klaps J Bioinformatics. 2024; 40(12).

PMID: 39626311 PMC: 11652268. DOI: 10.1093/bioinformatics/btae721.


How fast are viruses spreading in the wild?.

Dellicour S, Bastide P, Rocu P, Fargette D, Hardy O, Suchard M PLoS Biol. 2024; 22(12):e3002914.

PMID: 39625970 PMC: 11614233. DOI: 10.1371/journal.pbio.3002914.


Estimating dispersal rates and locating genetic ancestors with genome-wide genealogies.

Osmond M, Coop G Elife. 2024; 13.

PMID: 39589398 PMC: 11658769. DOI: 10.7554/eLife.72177.


References
1.
Bouckaert R, Vaughan T, Barido-Sottani J, Duchene S, Fourment M, Gavryushkina A . BEAST 2.5: An advanced software platform for Bayesian evolutionary analysis. PLoS Comput Biol. 2019; 15(4):e1006650. PMC: 6472827. DOI: 10.1371/journal.pcbi.1006650. View

2.
Drummond A, Ho S, Phillips M, Rambaut A . Relaxed phylogenetics and dating with confidence. PLoS Biol. 2006; 4(5):e88. PMC: 1395354. DOI: 10.1371/journal.pbio.0040088. View

3.
Gill M, Lemey P, Faria N, Rambaut A, Shapiro B, Suchard M . Improving Bayesian population dynamics inference: a coalescent-based model for multiple loci. Mol Biol Evol. 2012; 30(3):713-24. PMC: 3563973. DOI: 10.1093/molbev/mss265. View

4.
Cavalli-Sforza L, Edwards A . PHYLOGENETIC ANALYSIS: MODELS AND ESTIMATION PROCEDURES. Evolution. 2017; 21(3):550-570. DOI: 10.1111/j.1558-5646.1967.tb03411.x. View

5.
Gill M, Ho L, Baele G, Lemey P, Suchard M . A Relaxed Directional Random Walk Model for Phylogenetic Trait Evolution. Syst Biol. 2016; 66(3):299-319. PMC: 6075548. DOI: 10.1093/sysbio/syw093. View