» Articles » PMID: 26116928

Proteny: Discovering and Visualizing Statistically Significant Syntenic Clusters at the Proteome Level

Overview
Journal Bioinformatics
Specialty Biology
Date 2015 Jun 28
PMID 26116928
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Background: With more and more genomes being sequenced, detecting synteny between genomes becomes more and more important. However, for microorganisms the genomic divergence quickly becomes large, resulting in different codon usage and shuffling of gene order and gene elements such as exons.

Results: We present Proteny, a methodology to detect synteny between diverged genomes. It operates on the amino acid sequence level to be insensitive to codon usage adaptations and clusters groups of exons disregarding order to handle diversity in genomic ordering between genomes. Furthermore, Proteny assigns significance levels to the syntenic clusters such that they can be selected on statistical grounds. Finally, Proteny provides novel ways to visualize results at different scales, facilitating the exploration and interpretation of syntenic regions. We test the performance of Proteny on a standard ground truth dataset, and we illustrate the use of Proteny on two closely related genomes (two different strains of Aspergillus niger) and on two distant genomes (two species of Basidiomycota). In comparison to other tools, we find that Proteny finds clusters with more true homologies in fewer clusters that contain more genes, i.e. Proteny is able to identify a more consistent synteny. Further, we show how genome rearrangements, assembly errors, gene duplications and the conservation of specific genes can be easily studied with Proteny.

Availability And Implementation: Proteny is freely available at the Delft Bioinformatics Lab website http://bioinformatics.tudelft.nl/dbl/software.

Contact: t.gehrmann@tudelft.nl

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

De novo gene birth.

Van Oss S, Carvunis A PLoS Genet. 2019; 15(5):e1008160.

PMID: 31120894 PMC: 6542195. DOI: 10.1371/journal.pgen.1008160.


Approximate, simultaneous comparison of microbial genome architectures via syntenic anchoring of quiver representations.

Salazar A, Abeel T Bioinformatics. 2018; 34(17):i732-i742.

PMID: 30423098 PMC: 6129293. DOI: 10.1093/bioinformatics/bty614.


SimpleSynteny: a web-based tool for visualization of microsynteny across multiple species.

Veltri D, Wight M, Crouch J Nucleic Acids Res. 2016; 44(W1):W41-5.

PMID: 27141960 PMC: 4987899. DOI: 10.1093/nar/gkw330.

References
1.
Simillion C, Janssens K, Sterck L, Van de Peer Y . i-ADHoRe 2.0: an improved tool to detect degenerated genomic homology using genomic profiles. Bioinformatics. 2007; 24(1):127-8. DOI: 10.1093/bioinformatics/btm449. View

2.
Sinha A, Meller J . Cinteny: flexible analysis and visualization of synteny and genome rearrangements in multiple organisms. BMC Bioinformatics. 2007; 8:82. PMC: 1821339. DOI: 10.1186/1471-2105-8-82. View

3.
Ohm R, de Jong J, Lugones L, Aerts A, Kothe E, Stajich J . Genome sequence of the model mushroom Schizophyllum commune. Nat Biotechnol. 2010; 28(9):957-63. DOI: 10.1038/nbt.1643. View

4.
Cresnar B, Petric S . Cytochrome P450 enzymes in the fungal kingdom. Biochim Biophys Acta. 2010; 1814(1):29-35. DOI: 10.1016/j.bbapap.2010.06.020. View

5.
Angiuoli S, Salzberg S . Mugsy: fast multiple alignment of closely related whole genomes. Bioinformatics. 2010; 27(3):334-42. PMC: 3031037. DOI: 10.1093/bioinformatics/btq665. View