A Weighting System and Algorithm for Aligning Many Phylogenetically Related Sequences
Overview
Biomedical Engineering
Authors
Affiliations
Most multiple sequence alignment programs explicitly or implicity try to optimize some score associated with the resulting alignment. Although the sum-of-pairs score is currently most widely used, it is inappropriate when the phylogenetic relationships among the sequences to be aligned are not evenly distributed, since the contributions of densely populated groups dominate those of minor members. This paper proposes an iterative multiple sequence alignment method which optimizes a weighted sum-of-pairs score, in which the weights given to individual sequence pairs are adjusted to compensate for the biased contributions. A simple method that rapidly calculates such a set of weights for a given phylogenetic tree is presented. The multiple sequence alignment is refined through partitioning and realignment restricted to the edges of the tree. Under this restriction, profile-based fast and rigorous group-to-group alignment is achieved at each iteration, rendering the overall computational cost virtually identical to that using an unweighted score. Consistency of nearly 90% was attained between structural and sequence alignments of multiple divergent globins, confirming the effectiveness of this strategy in improving the quality of multiple sequence alignment.
Developments in Algorithms for Sequence Alignment: A Review.
Chao J, Tang F, Xu L Biomolecules. 2022; 12(4).
PMID: 35454135 PMC: 9024764. DOI: 10.3390/biom12040546.
A phylogenetic approach for weighting genetic sequences.
De Maio N, Alekseyenko A, Coleman-Smith W, Pardi F, Suchard M, Tamuri A BMC Bioinformatics. 2021; 22(1):285.
PMID: 34049487 PMC: 8164272. DOI: 10.1186/s12859-021-04183-8.
Molano E, Cabrera O, Jose J, do Nascimento L, Carazzolle M, Teixeira P BMC Genomics. 2018; 19(1):58.
PMID: 29343217 PMC: 5773145. DOI: 10.1186/s12864-018-4440-4.
Tan Z, Fu Y, Sharma G, Mathews D Nucleic Acids Res. 2017; 45(20):11570-11581.
PMID: 29036420 PMC: 5714223. DOI: 10.1093/nar/gkx815.
Galloway-Pena J, Liang X, Singh K, Yadav P, Chang C, La Rosa S J Bacteriol. 2014; 197(5):882-92.
PMID: 25512313 PMC: 4325096. DOI: 10.1128/JB.02288-14.