» Articles » PMID: 21082430

Structure-guided Rule-based Annotation of Protein Functional Sites in UniProt Knowledgebase

Overview
Specialty Molecular Biology
Date 2010 Nov 18
PMID 21082430
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

The rapid growth of protein sequence databases has necessitated the development of methods to computationally derive annotation for uncharacterized entries. Most such methods focus on "global" annotation, such as molecular function or biological process. Methods to supply high-accuracy "local" annotation to functional sites based on structural information at the level of individual amino acids are relatively rare. In this chapter we will describe a method we have developed for annotation of functional residues within experimentally-uncharacterized proteins that relies on position-specific site annotation rules (PIR Site Rules) derived from structural and experimental information. These PIR Site Rules are manually defined to allow for conditional propagation of annotation. Each rule specifies a tripartite set of conditions whereby candidates for annotation must pass a whole-protein classification test (that is, have end-to-end match to a whole-protein-based HMM), match a site-specific profile HMM and, finally, match functionally and structurally characterized residues of a template. Positive matches trigger the appropriate annotation for active site residues, binding site residues, modified residues, or other functionally important amino acids. The strict criteria used in this process have rendered high-confidence annotation suitable for UniProtKB/Swiss-Prot features.

Citing Articles

Quantifying microbial guilds.

Rivas-Santisteban J, Yubero P, Robaina-Estevez S, Gonzalez J, Tamames J, Pedros-Alio C ISME Commun. 2024; 4(1):ycae042.

PMID: 38707845 PMC: 11069341. DOI: 10.1093/ismeco/ycae042.


PIRSitePredict for protein functional site prediction using position-specific rules.

Chen C, Wang Q, Huang H, Vinayaka C, Garavelli J, Arighi C Database (Oxford). 2019; 2019.

PMID: 30805646 PMC: 6389862. DOI: 10.1093/database/baz026.


UniProt: a hub for protein information.

Nucleic Acids Res. 2014; 43(Database issue):D204-12.

PMID: 25348405 PMC: 4384041. DOI: 10.1093/nar/gku989.


Structural and functional studies of S-adenosyl-L-methionine binding proteins: a ligand-centric approach.

Gana R, Rao S, Huang H, Wu C, Vasudevan S BMC Struct Biol. 2013; 13:6.

PMID: 23617634 PMC: 3662625. DOI: 10.1186/1472-6807-13-6.


HAMAP in 2013, new developments in the protein family classification and annotation system.

Pedruzzi I, Rivoire C, Auchincloss A, Coudert E, Keller G, de Castro E Nucleic Acids Res. 2012; 41(Database issue):D584-9.

PMID: 23193261 PMC: 3531088. DOI: 10.1093/nar/gks1157.