Improving the Practical Space and Time Efficiency of the Shortest-paths Approach to Sum-of-pairs Multiple Sequence Alignment
Overview
Molecular Biology
Authors
Affiliations
The MSA program, written and distributed in 1989, is one of the few existing programs that attempts to find optimal alignments of multiple protein or DNA sequences. The MSA program implements a branch-and-bound technique together with a variant of Dijkstra's shortest paths algorithm to prune the basic dynamic programming graph. We have made substantial improvements in the time and space usage of MSA. The improvements make feasible a variety of problem instances that were not feasible previously. On some runs we achieve an order of magnitude reduction in space usage and a significant multiplicative factor speedup in running time. To explain how these improvements work, we give a much more detailed description of MSA than has been previously available. In practice, MSA rarely produces a provably optimal alignment and we explain why.
Wang X, Dong Q, Chen G, Zhang J, Liu Y, Cai Y BMC Genomics. 2022; 23(1):416.
PMID: 35655139 PMC: 9164415. DOI: 10.1186/s12864-022-08435-6.
Post-Alignment Adjustment and Its Automation.
Xia X Genes (Basel). 2021; 12(11).
PMID: 34828415 PMC: 8623120. DOI: 10.3390/genes12111809.
Noah K, Hao J, Li L, Sun X, Foley B, Yang Q Evol Bioinform Online. 2020; 16:1176934320903735.
PMID: 32076367 PMC: 7003163. DOI: 10.1177/1176934320903735.
Genome-wide analysis of positively selected genes in seasonal and non-seasonal breeding species.
Meng Y, Zhang W, Zhou J, Liu M, Chen J, Tian S PLoS One. 2015; 10(5):e0126736.
PMID: 26000771 PMC: 4441472. DOI: 10.1371/journal.pone.0126736.
Dennehey B, Noone S, Liu W, Smith L, Churchill M, Tyler J Mol Cell Biol. 2012; 33(3):605-21.
PMID: 23184661 PMC: 3666882. DOI: 10.1128/MCB.01053-12.