» Articles » PMID: 2849754

Multiple Sequence Alignment with Hierarchical Clustering

Overview
Specialty Biochemistry
Date 1988 Nov 25
PMID 2849754
Citations 1934
Authors
Affiliations
Soon will be listed here.
Abstract

An algorithm is presented for the multiple alignment of sequences, either proteins or nucleic acids, that is both accurate and easy to use on microcomputers. The approach is based on the conventional dynamic-programming method of pairwise alignment. Initially, a hierarchical clustering of the sequences is performed using the matrix of the pairwise alignment scores. The closest sequences are aligned creating groups of aligned sequences. Then close groups are aligned until all sequences are aligned in one group. The pairwise alignments included in the multiple alignment form a new matrix that is used to produce a hierarchical clustering. If it is different from the first one, iteration of the process can be performed. The method is illustrated by an example: a global alignment of 39 sequences of cytochrome c.

Citing Articles

Correcting promoter and beta-lactamase ORF orientation in a widely-used retroviral plasmid to restore bacterial growth.

Wittmann J Sci Rep. 2025; 15(1):8348.

PMID: 40069388 PMC: 11897316. DOI: 10.1038/s41598-025-93222-y.


Structural basis of Nipah virus RNA synthesis.

Sala F, Ditter K, Dybkov O, Urlaub H, Hillen H Nat Commun. 2025; 16(1):2261.

PMID: 40050611 PMC: 11885841. DOI: 10.1038/s41467-025-57219-5.


Fine mapping of the unique Ur-11 gene conferring broad resistance to the rust pathogen of common bean.

Valentini G, Hurtado-Gonzales O, Xavier L, He R, Gill U, Song Q Theor Appl Genet. 2025; 138(3):64.

PMID: 40035870 DOI: 10.1007/s00122-025-04856-5.


Genome resequencing and comparative analysis of Streptococcus mutans in adults with high and low caries risk.

Ucuncu M, Ucuncu M, Karacan I, Topcuoglu N Sci Data. 2025; 12(1):313.

PMID: 39984482 PMC: 11845470. DOI: 10.1038/s41597-025-04399-w.


Identification of Cocconeis neothumensis var. marina using a polyphasic approach including ultrastructure and gene annotation.

Somma E, Costantini M, Pennesi C, Ruocco N, De Castro O, Terlizzi A PLoS One. 2025; 20(2):e0317360.

PMID: 39946438 PMC: 11825096. DOI: 10.1371/journal.pone.0317360.


References
1.
NEEDLEMAN S, Wunsch C . A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970; 48(3):443-53. DOI: 10.1016/0022-2836(70)90057-4. View

2.
Barton G, STERNBERG M . Evaluation and improvements in the automatic alignment of protein sequences. Protein Eng. 1987; 1(2):89-94. DOI: 10.1093/protein/1.2.89. View

3.
Lipman D, Pearson W . Rapid and sensitive protein similarity searches. Science. 1985; 227(4693):1435-41. DOI: 10.1126/science.2983426. View

4.
Murata M, Richardson J, Sussman J . Simultaneous comparison of three protein sequences. Proc Natl Acad Sci U S A. 1985; 82(10):3073-7. PMC: 397716. DOI: 10.1073/pnas.82.10.3073. View

5.
Bains W . MULTAN: a program to align multiple DNA sequences. Nucleic Acids Res. 1986; 14(1):159-77. PMC: 339364. DOI: 10.1093/nar/14.1.159. View