NRL-3D: a Sequence-structure Database Derived from the Protein Data Bank (PDB) and Searchable Within the PIR Environment
Overview
Biotechnology
Authors
Affiliations
The protein identification resource (PIR) and the Brookhaven National Laboratory protein data bank (PDB) are well-known databases for primary sequences and three-dimensional structures of proteins, respectively. Lesk et al, have compared the primary sequences in these two databases and concluded that the sequences in them are not redundant. Moreover, PIR programs can not be used directly on PDB files to access primary sequences because the FORMATS of these two data bases are different. We have developed a sequence-structure database, called NRL-3D, from the sequences, chain identification and the residue numbers of proteins in the PDB. This new database is designed such that it can be used in conjunction with PIR programs to search and extract sequences of interest and the corresponding three-dimensional coordinates from the structures in PDB.
Peptide vocabulary analysis reveals ultra-conservation and homonymity in protein sequences.
Gatherer D Bioinform Biol Insights. 2010; 1:101-26.
PMID: 20066129 PMC: 2789693. DOI: 10.4137/bbi.s415.
Phylogenetic differences in content and intensity of periodic proteins.
Gatherer D, McEwan N J Mol Evol. 2005; 60(4):447-61.
PMID: 15883880 DOI: 10.1007/s00239-004-0189-2.
Gatherer D, McEwan N J Mol Evol. 2003; 57(2):149-58.
PMID: 14562959 DOI: 10.1007/s00239-002-2462-1.
The RESID Database of protein structure modifications and the NRL-3D Sequence-Structure Database.
Garavelli J, Hou Z, Pattabiraman N, Stephens R Nucleic Acids Res. 2000; 29(1):199-201.
PMID: 11125090 PMC: 29832. DOI: 10.1093/nar/29.1.199.
The protein information resource (PIR).
Barker W, Garavelli J, Huang H, McGarvey P, Orcutt B, Srinivasarao G Nucleic Acids Res. 1999; 28(1):41-4.
PMID: 10592177 PMC: 102418. DOI: 10.1093/nar/28.1.41.