» Articles » PMID: 24191891

Automated Analysis of Phylogenetic Clusters

Overview
Publisher Biomed Central
Specialty Biology
Date 2013 Nov 7
PMID 24191891
Citations 212
Authors
Affiliations
Soon will be listed here.
Abstract

Background: As sequence data sets used for the investigation of pathogen transmission patterns increase in size, automated tools and standardized methods for cluster analysis have become necessary. We have developed an automated Cluster Picker which identifies monophyletic clades meeting user-input criteria for bootstrap support and maximum genetic distance within large phylogenetic trees. A second tool, the Cluster Matcher, automates the process of linking genetic data to epidemiological or clinical data, and matches clusters between runs of the Cluster Picker.

Results: We explore the effect of different bootstrap and genetic distance thresholds on clusters identified in a data set of publicly available HIV sequences, and compare these results to those of a previously published tool for cluster identification. To demonstrate their utility, we then use the Cluster Picker and Cluster Matcher together to investigate how clusters in the data set changed over time. We find that clusters containing sequences from more than one UK location at the first time point (multiple origin) were significantly more likely to grow than those representing only a single location.

Conclusions: The Cluster Picker and Cluster Matcher can rapidly process phylogenetic trees containing tens of thousands of sequences. Together these tools will facilitate comparisons of pathogen transmission dynamics between studies and countries.

Citing Articles

Molecular network analysis for detecting HIV transmission clusters: insights and implications.

Liu Y, Hua L, Wu W, Ge Y, Li W, Wei P Front Public Health. 2025; 13:1429464.

PMID: 39944061 PMC: 11814211. DOI: 10.3389/fpubh.2025.1429464.


Genomic Epidemiology of the Main SARS-CoV-2 Variants Circulating in Italy During the Omicron Era.

Bergna A, Lai A, Sagradi F, Menzo S, Mancini N, Bruzzone B J Med Virol. 2025; 97(2):e70215.

PMID: 39936851 PMC: 11816846. DOI: 10.1002/jmv.70215.


Estimating the Current Routes of Transmission in HIV-1 F1 Subtype Infected Persons in Romania: Differences Between Self-Reporting and Phylogenetic Data.

Hohan R, Paraschiv S, Nicolae I, Otelea D Pathogens. 2024; 13(11).

PMID: 39599513 PMC: 11597275. DOI: 10.3390/pathogens13110960.


Dengue virus genomic surveillance in the applying Wolbachia to eliminate dengue trial reveals genotypic efficacy and disruption of focal transmission.

Edenborough K, Supriyati E, Dufault S, Arguni E, Indriani C, Denton J Sci Rep. 2024; 14(1):28004.

PMID: 39543157 PMC: 11564853. DOI: 10.1038/s41598-024-78008-y.


Phylogenetic-informed graph deep learning to classify dynamic transmission clusters in infectious disease epidemics.

Sun C, Li Y, Marini S, Riva A, Oliver Wu D, Fang R Bioinform Adv. 2024; 4(1):vbae158.

PMID: 39529841 PMC: 11552518. DOI: 10.1093/bioadv/vbae158.


References
1.
Chalmet K, Staelens D, Blot S, Dinakis S, Pelgrom J, Plum J . Epidemiological study of phylogenetic transmission clusters in a local HIV-1 epidemic reveals distinct differences between subtype B and non-B infections. BMC Infect Dis. 2010; 10:262. PMC: 2940905. DOI: 10.1186/1471-2334-10-262. View

2.
Hughes G, Fearnhill E, Dunn D, Lycett S, Rambaut A, Leigh Brown A . Molecular phylodynamics of the heterosexual HIV epidemic in the United Kingdom. PLoS Pathog. 2009; 5(9):e1000590. PMC: 2742734. DOI: 10.1371/journal.ppat.1000590. View

3.
Guan Y, Peiris J, Zheng B, Poon L, Chan K, Zeng F . Molecular epidemiology of the novel coronavirus that causes severe acute respiratory syndrome. Lancet. 2004; 363(9403):99-104. PMC: 7112497. DOI: 10.1016/s0140-6736(03)15259-2. View

4.
Aldous J, Kosakovsky Pond S, Poon A, Jain S, Qin H, Kahn J . Characterizing HIV transmission networks across the United States. Clin Infect Dis. 2012; 55(8):1135-43. PMC: 3529609. DOI: 10.1093/cid/cis612. View

5.
Kaye M, Chibo D, Birch C . Phylogenetic investigation of transmission pathways of drug-resistant HIV-1 utilizing pol sequences derived from resistance genotyping. J Acquir Immune Defic Syndr. 2008; 49(1):9-16. DOI: 10.1097/QAI.0b013e318180c8af. View