» Articles » PMID: 30137329

Phylogenetic Clustering of Genes Reveals Shared Evolutionary Trajectories and Putative Gene Functions

Overview
Date 2018 Aug 24
PMID 30137329
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

Homologous genes in prokaryotes can be described using phylogenetic profiles which summarize their patterns of presence or absence across a set of genomes. Phylogenetic profiles have been used for nearly twenty years to cluster genes based on measures such as the Euclidean distance between profile vectors. However, most approaches do not take into account the phylogenetic relationships amongst the profiled genomes, and overrepresentation of certain taxonomic groups (i.e., pathogenic species with many sequenced representatives) can skew the interpretation of profiles. We propose a new approach that uses a coevolutionary method defined by Pagel to account for the phylogenetic relationships amongst target organisms, and a hierarchical-clustering approach to define sets of genes with common distributions across the organisms. The clusters we obtain using our method show greater evidence of phylogenetic and functional clustering than a recently published approach based on hidden Markov models. Our clustering method identifies sets of amino-acid biosynthesis genes that constitute cohesive pathways, and motility/chemotaxis genes with common histories of descent and lateral gene transfer.

Citing Articles

gene family in passion fruit (): initial characterization and expression profiling diversity.

Hou Z, Liang J, Cai X, Lin J, Wang X, Liu R Front Plant Sci. 2024; 14:1279001.

PMID: 38312363 PMC: 10835403. DOI: 10.3389/fpls.2023.1279001.


Using multi-scale genomics to associate poorly annotated genes with rare diseases.

Canavati C, Sherill-Rofe D, Kamal L, Bloch I, Zahdeh F, Sharon E Genome Med. 2024; 16(1):4.

PMID: 38178268 PMC: 10765705. DOI: 10.1186/s13073-023-01276-2.


Genome-wide identification and expression analysis of the gene family in soybean () reveal their potential roles in response to abiotic stress.

Gong Y, Wang D, Xie H, Zhao Z, Chen Y, Zhang D Front Plant Sci. 2023; 14:1291731.

PMID: 38116151 PMC: 10728876. DOI: 10.3389/fpls.2023.1291731.


Detecting patterns of accessory genome coevolution in Staphylococcus aureus using data from thousands of genomes.

Mehta R, Petit 3rd R, Read T, Weissman D BMC Bioinformatics. 2023; 24(1):243.

PMID: 37296404 PMC: 10251594. DOI: 10.1186/s12859-023-05363-4.


Exploring the mobilome and resistome of in a One Health context across two continents.

Sanderson H, Gray K, Manuele A, Maguire F, Khan A, Liu C Microb Genom. 2022; 8(9).

PMID: 36129737 PMC: 9676038. DOI: 10.1099/mgen.0.000880.


References
1.
Stamatakis A . RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006; 22(21):2688-90. DOI: 10.1093/bioinformatics/btl446. View

2.
Ochman H, Lawrence J, Groisman E . Lateral gene transfer and the nature of bacterial innovation. Nature. 2000; 405(6784):299-304. DOI: 10.1038/35012500. View

3.
Pflughoeft K, Versalovic J . Human microbiome in health and disease. Annu Rev Pathol. 2011; 7:99-122. DOI: 10.1146/annurev-pathol-011811-132421. View

4.
Vert J . A tree kernel to analyse phylogenetic profiles. Bioinformatics. 2002; 18 Suppl 1:S276-84. DOI: 10.1093/bioinformatics/18.suppl_1.s276. View

5.
Ye Y, Choi J, Tang H . RAPSearch: a fast protein similarity search tool for short reads. BMC Bioinformatics. 2011; 12:159. PMC: 3113943. DOI: 10.1186/1471-2105-12-159. View