» Articles » PMID: 35578255

Bi-alignments with Affine Gaps Costs

Overview
Publisher Biomed Central
Date 2022 May 16
PMID 35578255
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Commonly, sequence and structure elements are assumed to evolve congruently, such that homologous sequence positions correspond to homologous structural features. Assuming congruent evolution, alignments based on sequence and structure similarity can therefore optimize both similarities at the same time in a single alignment. To model incongruent evolution, where sequence and structural features diverge positionally, we recently introduced bi-alignments. This generalization of sequence and structure-based alignments is best understood as alignments of two distinct pairwise alignments of the same entities: one modeling sequence similarity, the other structural similarity.

Results: Optimal bi-alignments with affine gap costs (or affine shift cost) for two constituent alignments can be computed exactly in quartic space and time. Even bi-alignments with affine shift and gap cost, as well as bi-alignment with sub-additive gap cost are optimized efficiently. Affine gap-cost bi-alignment of large proteins ([Formula: see text] aa) can be computed.

Conclusion: Affine cost bi-alignments are of practical interest to study shifts of protein sequences and protein structures relative to each other.

Availability: The affine cost bi-alignment algorithm has been implemented in Python 3 and Cython. It is available as free software from https://github.com/s-will/BiAlign/releases/tag/v0.3 and as bioconda package bialign.

Citing Articles

Comprehensive survey of conserved RNA secondary structures in full-genome alignment of Hepatitis C virus.

Triebel S, Lamkiewicz K, Ontiveros N, Sweeney B, Stadler P, Petrov A Sci Rep. 2024; 14(1):15145.

PMID: 38956134 PMC: 11219754. DOI: 10.1038/s41598-024-62897-0.


An ascomycete H4 variant with an unknown function.

Flipphi M, Harispe M, Hamari Z, Kocsube S, Scazzocchio C, Ramon A R Soc Open Sci. 2024; 11(2):231705.

PMID: 38384781 PMC: 10878826. DOI: 10.1098/rsos.231705.

References
1.
Seton Bocco S, Csuros M . Splice Sites Seldom Slide: Intron Evolution in Oomycetes. Genome Biol Evol. 2016; 8(8):2340-50. PMC: 5010894. DOI: 10.1093/gbe/evw157. View

2.
Baek M, DiMaio F, Anishchenko I, Dauparas J, Ovchinnikov S, Lee G . Accurate prediction of protein structures and interactions using a three-track neural network. Science. 2021; 373(6557):871-876. PMC: 7612213. DOI: 10.1126/science.abj8754. View

3.
Lehmann J, Eisenhardt C, Stadler P, Krauss V . Some novel intron positions in conserved Drosophila genes are caused by intron sliding or tandem duplication. BMC Evol Biol. 2010; 10:156. PMC: 2891723. DOI: 10.1186/1471-2148-10-156. View

4.
Eddy S . Where did the BLOSUM62 alignment score matrix come from?. Nat Biotechnol. 2004; 22(8):1035-6. DOI: 10.1038/nbt0804-1035. View

5.
Cheng Li S . The difficulty of protein structure alignment under the RMSD. Algorithms Mol Biol. 2013; 8(1):1. PMC: 3599502. DOI: 10.1186/1748-7188-8-1. View