» Articles » PMID: 21988835

Fast, Scalable Generation of High-quality Protein Multiple Sequence Alignments Using Clustal Omega

Overview
Journal Mol Syst Biol
Specialty Molecular Biology
Date 2011 Oct 13
PMID 21988835
Citations 7466
Authors
Affiliations
Soon will be listed here.
Abstract

Multiple sequence alignments are fundamental to many sequence analysis methods. Most alignments are computed using the progressive alignment heuristic. These methods are starting to become a bottleneck in some analysis pipelines when faced with data sets of the size of many thousands of sequences. Some methods allow computation of larger data sets while sacrificing quality, and others produce high-quality alignments, but scale badly with the number of sequences. In this paper, we describe a new program called Clustal Omega, which can align virtually any number of protein sequences quickly and that delivers accurate alignments. The accuracy of the package on smaller test cases is similar to that of the high-quality aligners. On larger data sets, Clustal Omega outperforms other packages in terms of execution time and quality. Clustal Omega also has powerful features for adding sequences to and exploiting information in existing alignments, making use of the vast amount of precomputed information in public databases like Pfam.

Citing Articles

De Novo Design of Large Polypeptides Using a Lightweight Diffusion Model Integrating LSTM and Attention Mechanism Under Per-Residue Secondary Structure Constraints.

Liao S, Xu G, Jin L, Ma J Molecules. 2025; 30(5).

PMID: 40076339 PMC: 11902264. DOI: 10.3390/molecules30051116.


Mapping variants in thyroid hormone transporter MCT8 to disease severity by genomic, phenotypic, functional, structural and deep learning integration.

Groeneweg S, van Geest F, Martin M, Dias M, Frazer J, Medina-Gomez C Nat Commun. 2025; 16(1):2479.

PMID: 40075072 PMC: 11904026. DOI: 10.1038/s41467-025-56628-w.


Genomic and proteomic analyses of Nus-dependent non-lambdoid phages reveal a novel coliphage group prevalent in gut: mEp.

Negrete-Mendez H, Valencia-Toxqui G, Sepulveda-Robles O, Rios-Castro E, Hurtado-Cortes J, Flores V Front Microbiol. 2025; 16:1480411.

PMID: 40066275 PMC: 11893012. DOI: 10.3389/fmicb.2025.1480411.


Phylogenomic Analysis Reveals Evolutionary Relationships of Tropical Drosophilidae: From to .

Detcharoen M, Pramual P, Nilsai A Ecol Evol. 2025; 15(3):e71100.

PMID: 40065921 PMC: 11893110. DOI: 10.1002/ece3.71100.


Investigation of the Blood Microbiome in Horses With Fever of Unknown Origin.

Sun Y, Yu Y, Castillo X, Anderson R, Wang M, Sun Q Vet Med Sci. 2025; 11(2):e70272.

PMID: 40065594 PMC: 11893731. DOI: 10.1002/vms3.70272.


References
1.
LARKIN M, Blackshields G, Brown N, Chenna R, McGettigan P, McWilliam H . Clustal W and Clustal X version 2.0. Bioinformatics. 2007; 23(21):2947-8. DOI: 10.1093/bioinformatics/btm404. View

2.
Finn R, Mistry J, Tate J, Coggill P, Heger A, Pollington J . The Pfam protein families database. Nucleic Acids Res. 2009; 38(Database issue):D211-22. PMC: 2808889. DOI: 10.1093/nar/gkp985. View

3.
Bradley R, Roberts A, Smoot M, Juvekar S, Do J, Dewey C . Fast statistical alignment. PLoS Comput Biol. 2009; 5(5):e1000392. PMC: 2684580. DOI: 10.1371/journal.pcbi.1000392. View

4.
Hogeweg P, Hesper B . The alignment of sets of sequences and the construction of phyletic trees: an integrated method. J Mol Evol. 1984; 20(2):175-86. DOI: 10.1007/BF02257378. View

5.
Mizuguchi K, Deane C, Blundell T, Overington J . HOMSTRAD: a database of protein structure alignments for homologous families. Protein Sci. 1998; 7(11):2469-71. PMC: 2143859. DOI: 10.1002/pro.5560071126. View