» Articles » PMID: 8590178

A Weighting System and Algorithm for Aligning Many Phylogenetically Related Sequences

Overview
Date 1995 Oct 1
PMID 8590178
Citations 23
Authors
Affiliations
Soon will be listed here.
Abstract

Most multiple sequence alignment programs explicitly or implicity try to optimize some score associated with the resulting alignment. Although the sum-of-pairs score is currently most widely used, it is inappropriate when the phylogenetic relationships among the sequences to be aligned are not evenly distributed, since the contributions of densely populated groups dominate those of minor members. This paper proposes an iterative multiple sequence alignment method which optimizes a weighted sum-of-pairs score, in which the weights given to individual sequence pairs are adjusted to compensate for the biased contributions. A simple method that rapidly calculates such a set of weights for a given phylogenetic tree is presented. The multiple sequence alignment is refined through partitioning and realignment restricted to the edges of the tree. Under this restriction, profile-based fast and rigorous group-to-group alignment is achieved at each iteration, rendering the overall computational cost virtually identical to that using an unweighted score. Consistency of nearly 90% was attained between structural and sequence alignments of multiple divergent globins, confirming the effectiveness of this strategy in improving the quality of multiple sequence alignment.

Citing Articles

Developments in Algorithms for Sequence Alignment: A Review.

Chao J, Tang F, Xu L Biomolecules. 2022; 12(4).

PMID: 35454135 PMC: 9024764. DOI: 10.3390/biom12040546.


A phylogenetic approach for weighting genetic sequences.

De Maio N, Alekseyenko A, Coleman-Smith W, Pardi F, Suchard M, Tamuri A BMC Bioinformatics. 2021; 22(1):285.

PMID: 34049487 PMC: 8164272. DOI: 10.1186/s12859-021-04183-8.


Ceratocystis cacaofunesta genome analysis reveals a large expansion of extracellular phosphatidylinositol-specific phospholipase-C genes (PI-PLC).

Molano E, Cabrera O, Jose J, do Nascimento L, Carazzolle M, Teixeira P BMC Genomics. 2018; 19(1):58.

PMID: 29343217 PMC: 5773145. DOI: 10.1186/s12864-018-4440-4.


TurboFold II: RNA structural alignment and secondary structure prediction informed by multiple homologs.

Tan Z, Fu Y, Sharma G, Mathews D Nucleic Acids Res. 2017; 45(20):11570-11581.

PMID: 29036420 PMC: 5714223. DOI: 10.1093/nar/gkx815.


The identification and functional characterization of WxL proteins from Enterococcus faecium reveal surface proteins involved in extracellular matrix interactions.

Galloway-Pena J, Liang X, Singh K, Yadav P, Chang C, La Rosa S J Bacteriol. 2014; 197(5):882-92.

PMID: 25512313 PMC: 4325096. DOI: 10.1128/JB.02288-14.