» Articles » PMID: 21622657

BAR-PLUS: the Bologna Annotation Resource Plus for Functional and Structural Annotation of Protein Sequences

Overview
Specialty Biochemistry
Date 2011 May 31
PMID 21622657
Citations 15
Authors
Affiliations
Soon will be listed here.
Abstract

We introduce BAR-PLUS (BAR(+)), a web server for functional and structural annotation of protein sequences. BAR(+) is based on a large-scale genome cross comparison and a non-hierarchical clustering procedure characterized by a metric that ensures a reliable transfer of features within clusters. In this version, the method takes advantage of a large-scale pairwise sequence comparison of 13,495,736 protein chains also including 988 complete proteomes. Available sequence annotation is derived from UniProtKB, GO, Pfam and PDB. When PDB templates are present within a cluster (with or without their SCOP classification), profile Hidden Markov Models (HMMs) are computed on the basis of sequence to structure alignment and are cluster-associated (Cluster-HMM). Therefrom, a library of 10,858 HMMs is made available for aligning even distantly related sequences for structural modelling. The server also provides pairwise query sequence-structural target alignments computed from the correspondent Cluster-HMM. BAR(+) in its present version allows three main categories of annotation: PDB [with or without SCOP (*)] and GO and/or Pfam; PDB (*) without GO and/or Pfam; GO and/or Pfam without PDB (*) and no annotation. Each category can further comprise clusters where GO and Pfam functional annotations are or are not statistically significant. BAR(+) is available at http://bar.biocomp.unibo.it/bar2.0.

Citing Articles

A large-scale assessment of sequence database search tools for homology-based protein function prediction.

Zhang C, Freddolino L Brief Bioinform. 2024; 25(4).

PMID: 39038936 PMC: 11262835. DOI: 10.1093/bib/bbae349.


Pseudo2GO: A Graph-Based Deep Learning Method for Pseudogene Function Prediction by Borrowing Information From Coding Genes.

Fan K, Zhang Y Front Genet. 2020; 11:807.

PMID: 33014009 PMC: 7461887. DOI: 10.3389/fgene.2020.00807.


Graph2GO: a multi-modal attributed network embedding method for inferring protein functions.

Fan K, Guan Y, Zhang Y Gigascience. 2020; 9(8).

PMID: 32770210 PMC: 7414417. DOI: 10.1093/gigascience/giaa081.


I-TASSER gateway: A protein structure and function prediction server powered by XSEDE.

Zheng W, Zhang C, Bell E, Zhang Y Future Gener Comput Syst. 2019; 99:73-85.

PMID: 31427836 PMC: 6699767. DOI: 10.1016/j.future.2019.04.011.


INGA 2.0: improving protein function prediction for the dark proteome.

Piovesan D, Tosatto S Nucleic Acids Res. 2019; 47(W1):W373-W378.

PMID: 31073595 PMC: 6602455. DOI: 10.1093/nar/gkz375.


References
1.
Cuff A, Sillitoe I, Lewis T, Clegg A, Rentzsch R, Furnham N . Extending CATH: increasing coverage of the protein structure universe and linking structure with function. Nucleic Acids Res. 2010; 39(Database issue):D420-6. PMC: 3013636. DOI: 10.1093/nar/gkq1001. View

2.
Wu C, Xiao C, Hou Z, Huang H, Barker W . iProClass: an integrated, comprehensive and annotated protein classification database. Nucleic Acids Res. 2000; 29(1):52-4. PMC: 29833. DOI: 10.1093/nar/29.1.52. View

3.
Kaplan N, Sasson O, Inbar U, Friedlich M, Fromer M, Fleischer H . ProtoNet 4.0: a hierarchical classification of one million protein sequences. Nucleic Acids Res. 2004; 33(Database issue):D216-8. PMC: 539961. DOI: 10.1093/nar/gki007. View

4.
Petryszak R, Kretschmann E, Wieser D, Apweiler R . The predictive power of the CluSTr database. Bioinformatics. 2005; 21(18):3604-9. DOI: 10.1093/bioinformatics/bti542. View

5.
Loewenstein Y, Portugaly E, Fromer M, Linial M . Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space. Bioinformatics. 2008; 24(13):i41-9. PMC: 2718652. DOI: 10.1093/bioinformatics/btn174. View