» Articles » PMID: 12136088

MAFFT: a Novel Method for Rapid Multiple Sequence Alignment Based on Fast Fourier Transform

Overview
Specialty Biochemistry
Date 2002 Jul 24
PMID 12136088
Citations 6409
Authors
Affiliations
Soon will be listed here.
Abstract

A multiple sequence alignment program, MAFFT, has been developed. The CPU time is drastically reduced as compared with existing methods. MAFFT includes two novel techniques. (i) Homo logous regions are rapidly identified by the fast Fourier transform (FFT), in which an amino acid sequence is converted to a sequence composed of volume and polarity values of each amino acid residue. (ii) We propose a simplified scoring system that performs well for reducing CPU time and increasing the accuracy of alignments even for sequences having large insertions or extensions as well as distantly related sequences of similar length. Two different heuristics, the progressive method (FFT-NS-2) and the iterative refinement method (FFT-NS-i), are implemented in MAFFT. The performances of FFT-NS-2 and FFT-NS-i were compared with other methods by computer simulations and benchmark tests; the CPU time of FFT-NS-2 is drastically reduced as compared with CLUSTALW with comparable accuracy. FFT-NS-i is over 100 times faster than T-COFFEE, when the number of input sequences exceeds 60, without sacrificing the accuracy.

Citing Articles

Diversity and functional features of the root-associated bacteriome are dependent on grapevine susceptibility to Plasmopara viticola.

Duret M, Wallner A, Besaury L, Aziz A Environ Microbiome. 2025; 20(1):30.

PMID: 40087775 DOI: 10.1186/s40793-025-00690-w.


Integration of therapeutic cargo into the human genome with programmable type V-K CAST.

Liu J, Aliaga Goltsman D, Alexander L, Khayi K, Hong J, Dunham D Nat Commun. 2025; 16(1):2427.

PMID: 40082411 PMC: 11906591. DOI: 10.1038/s41467-025-57416-2.


Genomic surveillance of emerging SARS-CoV-2 Omicron variations in Tianjin Municipality, China 2022.

Gao X, Zou M, Lei Y, Tan Z, Zhuang Z, Zheng B Biosaf Health. 2025; 6(2):61-69.

PMID: 40078941 PMC: 11895026. DOI: 10.1016/j.bsheal.2024.03.001.


Biosynthesis of Gold Nanostructures and Their Virucidal Activity Against Influenza A Virus.

Contreras F, Rivero K, Rivas-Pardo J, Liendo F, Segura R, Neira N Int J Mol Sci. 2025; 26(5).

PMID: 40076560 PMC: 11899802. DOI: 10.3390/ijms26051934.


De Novo Design of Large Polypeptides Using a Lightweight Diffusion Model Integrating LSTM and Attention Mechanism Under Per-Residue Secondary Structure Constraints.

Liao S, Xu G, Jin L, Ma J Molecules. 2025; 30(5).

PMID: 40076339 PMC: 11902264. DOI: 10.3390/molecules30051116.


References
1.
McClure M, Vasi T, FITCH W . Comparative analysis of multiple protein-sequence alignment methods. Mol Biol Evol. 1994; 11(4):571-92. DOI: 10.1093/oxfordjournals.molbev.a040138. View

2.
Thompson J, Plewniak F, Poch O . A comprehensive comparison of multiple sequence alignment programs. Nucleic Acids Res. 1999; 27(13):2682-90. PMC: 148477. DOI: 10.1093/nar/27.13.2682. View

3.
Hirosawa M, Totoki Y, HOSHIDA M, Ishikawa M . Comprehensive study on iterative algorithms of multiple sequence alignment. Comput Appl Biosci. 1995; 11(1):13-8. DOI: 10.1093/bioinformatics/11.1.13. View

4.
Vogt G, Etzold T, Argos P . An assessment of amino acid exchange matrices in aligning protein sequences: the twilight zone revisited. J Mol Biol. 1995; 249(4):816-31. DOI: 10.1006/jmbi.1995.0340. View

5.
Gotoh O . A weighting system and algorithm for aligning many phylogenetically related sequences. Comput Appl Biosci. 1995; 11(5):543-51. DOI: 10.1093/bioinformatics/11.5.543. View