» Articles » PMID: 11125126

BAliBASE (Benchmark Alignment DataBASE): Enhancements for Repeats, Transmembrane Sequences and Circular Permutations

Overview
Specialty Biochemistry
Date 2000 Jan 11
PMID 11125126
Citations 45
Authors
Affiliations
Soon will be listed here.
Abstract

BAliBASE is specifically designed to serve as an evaluation resource to address all the problems encountered when aligning complete sequences. The database contains high quality, manually constructed multiple sequence alignments together with detailed annotations. The alignments are all based on three-dimensional structural superpositions, with the exception of the transmembrane sequences. The first release provided sets of reference alignments dealing with the problems of high variability, unequal repartition and large N/C-terminal extensions and internal insertions. Here we describe version 2.0 of the database, which incorporates three new reference sets of alignments containing structural repeats, trans-membrane sequences and circular permutations to evaluate the accuracy of detection/prediction and alignment of these complex sequences. BAliBASE can be viewed at the web site http://www-igbmc.u-strasbg. fr/BioInfo/BAliBASE2/index.html or can be downloaded from ftp://ftp-igbmc.u-strasbg.fr/pub/BAliBASE2 /.

Citing Articles

Sequence Flow: interactive web application for visualizing partial order alignments.

Zdablasz K, Lisiecka A, Dojer N BMC Genomics. 2024; 25(1):973.

PMID: 39415087 PMC: 11483981. DOI: 10.1186/s12864-024-10886-y.


Embedding-based alignment: combining protein language models with dynamic programming alignment to detect structural similarities in the twilight-zone.

Pantolini L, Studer G, Pereira J, Durairaj J, Tauriello G, Schwede T Bioinformatics. 2024; 40(1).

PMID: 38175775 PMC: 10792726. DOI: 10.1093/bioinformatics/btad786.


DCAlign v1.0: aligning biological sequences using co-evolution models and informed priors.

Muntoni A, Pagnani A Bioinformatics. 2023; 39(9).

PMID: 37647658 PMC: 10491954. DOI: 10.1093/bioinformatics/btad537.


Accuracy of multiple sequence alignment methods in the reconstruction of transposable element families.

Hubley R, Wheeler T, Smit A NAR Genom Bioinform. 2022; 4(2):lqac040.

PMID: 35591887 PMC: 9112768. DOI: 10.1093/nargab/lqac040.


Application of the MAHDS Method for Multiple Alignment of Highly Diverged Amino Acid Sequences.

Kostenko D, Korotkov E Int J Mol Sci. 2022; 23(7).

PMID: 35409125 PMC: 8998981. DOI: 10.3390/ijms23073764.


References
1.
Bateman A, Birney E, Durbin R, Eddy S, Howe K, Sonnhammer E . The Pfam protein families database. Nucleic Acids Res. 1999; 28(1):263-6. PMC: 102420. DOI: 10.1093/nar/28.1.263. View

2.
Gromiha M . A simple method for predicting transmembrane alpha helices with better accuracy. Protein Eng. 1999; 12(7):557-61. DOI: 10.1093/protein/12.7.557. View

3.
Andrade M, Ponting C, Gibson T, Bork P . Homology-based method for identification of protein repeats using statistical significance estimates. J Mol Biol. 2000; 298(3):521-37. DOI: 10.1006/jmbi.2000.3684. View

4.
Lio P, Vannucci M . Wavelet change-point prediction of transmembrane proteins. Bioinformatics. 2000; 16(4):376-82. DOI: 10.1093/bioinformatics/16.4.376. View

5.
Thompson J, Plewniak F, Thierry J, Poch O . DbClustal: rapid and reliable global multiple alignments of protein sequences detected by database searches. Nucleic Acids Res. 2000; 28(15):2919-26. PMC: 102675. DOI: 10.1093/nar/28.15.2919. View