» Articles » PMID: 19765311

PhyloPattern: Regular Expressions to Identify Complex Patterns in Phylogenetic Trees

Overview
Publisher Biomed Central
Specialty Biology
Date 2009 Sep 22
PMID 19765311
Citations 40
Authors
Affiliations
Soon will be listed here.
Abstract

Background: To effectively apply evolutionary concepts in genome-scale studies, large numbers of phylogenetic trees have to be automatically analysed, at a level approaching human expertise. Complex architectures must be recognized within the trees, so that associated information can be extracted.

Results: Here, we present a new software library, PhyloPattern, for automating tree manipulations and analysis. PhyloPattern includes three main modules, which address essential tasks in high-throughput phylogenetic tree analysis: node annotation, pattern matching, and tree comparison. PhyloPattern thus allows the programmer to focus on: i) the use of predefined or user defined annotation functions to perform immediate or deferred evaluation of node properties, ii) the search for user-defined patterns in large phylogenetic trees, iii) the pairwise comparison of trees by dynamically generating patterns from one tree and applying them to the other.

Conclusion: PhyloPattern greatly simplifies and accelerates the work of the computer scientist in the evolutionary biology field. The library has been used to automatically identify phylogenetic evidence for domain shuffling or gene loss events in the evolutionary histories of protein sequences. However any workflow that relies on phylogenetic tree analysis, could be automated with PhyloPattern.

Citing Articles

Effect of a Probiotic Beverage Enriched with Cricket Proteins on the Gut Microbiota: Composition of Gut and Correlation with Nutritional Parameters.

Dridi C, Millette M, Salmieri S, Aguilar Uscanga B, Lacroix S, Venneri T Foods. 2024; 13(2).

PMID: 38254505 PMC: 10814958. DOI: 10.3390/foods13020204.


Methods to Identify and Study the Evolution of Pseudogenes Using a Phylogenetic Approach.

Dainat J, Pontarotti P Methods Mol Biol. 2021; 2324:21-34.

PMID: 34165706 DOI: 10.1007/978-1-0716-1503-4_2.


Non-contiguous finished genome sequencing and description of sp. nov. isolated from human sputum.

Mbogning Fonkou M, Bilen M, Gouba N, Khelaifia S, Cadoret F, Nguyen T New Microbes New Infect. 2019; 29:100532.

PMID: 31011427 PMC: 6461582. DOI: 10.1016/j.nmni.2019.100532.


Genome sequence and description of gen. nov., sp. nov., a new bacterial genus isolated from human left colon.

Bonnet M, Mailhe M, Ricaboni D, Labas N, Richez M, Vitton V New Microbes New Infect. 2019; 29:100520.

PMID: 30949346 PMC: 6428956. DOI: 10.1016/j.nmni.2019.100520.


Noncontiguous finished genome sequences and description of sp. nov., sp. nov., sp. nov., sp. nov., sp. nov. and sp. nov. identified by culturomics.

Andrieu C, Mailhe M, Ricaboni D, Fonkou M, Bilen M, Cadoret F New Microbes New Infect. 2018; 26:73-88.

PMID: 30258636 PMC: 6154776. DOI: 10.1016/j.nmni.2018.06.006.


References
1.
Felsenstein J . Evolutionary trees from DNA sequences: a maximum likelihood approach. J Mol Evol. 1981; 17(6):368-76. DOI: 10.1007/BF01734359. View

2.
Engelhardt B, Jordan M, Muratore K, Brenner S . Protein molecular function prediction by Bayesian phylogenomics. PLoS Comput Biol. 2005; 1(5):e45. PMC: 1246806. DOI: 10.1371/journal.pcbi.0010045. View

3.
Huson D, Bryant D . Application of phylogenetic networks in evolutionary studies. Mol Biol Evol. 2005; 23(2):254-67. DOI: 10.1093/molbev/msj030. View

4.
Gouret P, Vitiello V, Balandraud N, Gilles A, Pontarotti P, Danchin E . FIGENIX: intelligent automation of genomic annotation: expertise integration in a new software platform. BMC Bioinformatics. 2005; 6:198. PMC: 1188056. DOI: 10.1186/1471-2105-6-198. View

5.
Arvestad L, Berglund A, Lagergren J, Sennblad B . Bayesian gene/species tree reconciliation and orthology analysis using MCMC. Bioinformatics. 2003; 19 Suppl 1:i7-15. DOI: 10.1093/bioinformatics/btg1000. View