» Articles » PMID: 33085643

Bayesian Reconstruction of Transmission Trees from Genetic Sequences and Uncertain Infection Times

Overview
Date 2020 Oct 21
PMID 33085643
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Genetic sequence data of pathogens are increasingly used to investigate transmission dynamics in both endemic diseases and disease outbreaks. Such research can aid in the development of appropriate interventions and in the design of studies to evaluate them. Several computational methods have been proposed to infer transmission chains from sequence data; however, existing methods do not generally reliably reconstruct transmission trees because genetic sequence data or inferred phylogenetic trees from such data contain insufficient information for accurate estimation of transmission chains. Here, we show by simulation studies that incorporating infection times, even when they are uncertain, can greatly improve the accuracy of reconstruction of transmission trees. To achieve this improvement, we propose a Bayesian inference methods using Markov chain Monte Carlo that directly draws samples from the space of transmission trees under the assumption of complete sampling of the outbreak. The likelihood of each transmission tree is computed by a phylogenetic model by treating its internal nodes as transmission events. By a simulation study, we demonstrate that accuracy of the reconstructed transmission trees depends mainly on the amount of information available on times of infection; we show superiority of the proposed method to two alternative approaches when infection times are known up to specified degrees of certainty. In addition, we illustrate the use of a multiple imputation framework to study features of epidemic dynamics, such as the relationship between characteristics of nodes and average number of outbound edges or inbound edges, signifying possible transmission events from and to nodes. We apply the proposed method to a transmission cluster in San Diego and to a dataset from the 2014 Sierra Leone Ebola virus outbreak and investigate the impact of biological, behavioral, and demographic factors.

Citing Articles

Network methods and design of randomized trials: Application to investigation of COVID-19 vaccination boosters.

DeGruttola V, Goyal R, Martin N, Wang R Clin Trials. 2022; 19(4):363-374.

PMID: 35894099 PMC: 9378506. DOI: 10.1177/17407745221111818.


Methods Combining Genomic and Epidemiological Data in the Reconstruction of Transmission Trees: A Systematic Review.

Duault H, Durand B, Canini L Pathogens. 2022; 11(2).

PMID: 35215195 PMC: 8875843. DOI: 10.3390/pathogens11020252.


Statistical outbreak detection by joining medical records and pathogen similarity.

Miller J, Chen J, Sundermann A, Marsh J, Saul M, Shutt K J Biomed Inform. 2019; 91:103126.

PMID: 30771483 PMC: 6424617. DOI: 10.1016/j.jbi.2019.103126.

References
1.
Ferguson N, Donnelly C, Anderson R . Transmission intensity and impact of control policies on the foot and mouth epidemic in Great Britain. Nature. 2001; 413(6855):542-8. DOI: 10.1038/35097116. View

2.
Gire S, Goba A, Andersen K, Sealfon R, Park D, Kanneh L . Genomic surveillance elucidates Ebola virus origin and transmission during the 2014 outbreak. Science. 2014; 345(6202):1369-72. PMC: 4431643. DOI: 10.1126/science.1259657. View

3.
Leventhal G, Kouyos R, Stadler T, von Wyl V, Yerly S, Boni J . Inferring epidemic contact structure from phylogenetic trees. PLoS Comput Biol. 2012; 8(3):e1002413. PMC: 3297558. DOI: 10.1371/journal.pcbi.1002413. View

4.
Hall M, Woolhouse M, Rambaut A . Using genomics data to reconstruct transmission trees during disease outbreaks. Rev Sci Tech. 2016; 35(1):287-96. PMC: 5844463. DOI: 10.20506/rst.35.1.2433. View

5.
Ypma R, Bataille A, Stegeman A, Koch G, Wallinga J, van Ballegooijen W . Unravelling transmission trees of infectious diseases by combining genetic and epidemiological data. Proc Biol Sci. 2011; 279(1728):444-50. PMC: 3234549. DOI: 10.1098/rspb.2011.0913. View