» Articles » PMID: 17135200

The CATH Domain Structure Database: New Protocols and Classification Levels Give a More Comprehensive Resource for Exploring Evolution

Overview
Specialty Biochemistry
Date 2006 Dec 1
PMID 17135200
Citations 140
Authors
Affiliations
Soon will be listed here.
Abstract

We report the latest release (version 3.0) of the CATH protein domain database (http://www.cathdb.info). There has been a 20% increase in the number of structural domains classified in CATH, up to 86 151 domains. Release 3.0 comprises 1110 fold groups and 2147 homologous superfamilies. To cope with the increases in diverse structural homologues being determined by the structural genomics initiatives, more sensitive methods have been developed for identifying boundaries in multi-domain proteins and for recognising homologues. The CATH classification update is now being driven by an integrated pipeline that links these automated procedures with validation steps, that have been made easier by the provision of information rich web pages summarising comparison scores and relevant links to external sites for each domain being classified. An analysis of the population of domains in the CATH hierarchy and several domain characteristics are presented for version 3.0. We also report an update of the CATH Dictionary of homologous structures (CATH-DHS) which now contains multiple structural alignments, consensus information and functional annotations for 1459 well populated superfamilies in CATH. CATH is directly linked to the Gene3D database which is a projection of CATH structural data onto approximately 2 million sequences in completed genomes and UniProt.

Citing Articles

Use of phosphotyrosine-containing peptides to target SH2 domains: Antagonist peptides of the Crk/CrkL-p130Cas axis.

Douglas J, Johnson D, Roy A, Park T Methods Enzymol. 2024; 698:301-342.

PMID: 38886037 PMC: 11542726. DOI: 10.1016/bs.mie.2024.04.013.


Identification of a covert evolutionary pathway between two protein folds.

Chakravarty D, Sreenivasan S, Swint-Kruse L, Porter L Nat Commun. 2023; 14(1):3177.

PMID: 37264049 PMC: 10235069. DOI: 10.1038/s41467-023-38519-0.


Three-dimensional Structure Databases of Biological Macromolecules.

Waman V, Orengo C, Kleywegt G, Lesk A Methods Mol Biol. 2022; 2449:43-91.

PMID: 35507259 DOI: 10.1007/978-1-0716-2095-3_3.


Multi-layer sequential network analysis improves protein 3D structural classification.

Newaz K, Piland J, Clark P, Emrich S, Li J, Milenkovic T Proteins. 2022; 90(9):1721-1731.

PMID: 35441395 PMC: 9356989. DOI: 10.1002/prot.26349.


Structural bioinformatic analysis of DsbA proteins and their pathogenicity associated substrates.

Santos-Martin C, Wang G, Subedi P, Hor L, Totsika M, Paxman J Comput Struct Biotechnol J. 2021; 19:4725-4737.

PMID: 34504665 PMC: 8405906. DOI: 10.1016/j.csbj.2021.08.018.


References
1.
Kanehisa M, Goto S . KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 1999; 28(1):27-30. PMC: 102409. DOI: 10.1093/nar/28.1.27. View

2.
Bairoch A, Apweiler R . The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 1999; 28(1):45-8. PMC: 102476. DOI: 10.1093/nar/28.1.45. View

3.
Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H . The Protein Data Bank. Nucleic Acids Res. 1999; 28(1):235-42. PMC: 102472. DOI: 10.1093/nar/28.1.235. View

4.
Bray J, Todd A, Pearl F, Thornton J, Orengo C . The CATH Dictionary of Homologous Superfamilies (DHS): a consensus approach for identifying distant structural homologues. Protein Eng. 2000; 13(3):153-65. DOI: 10.1093/protein/13.3.153. View

5.
Pearl F, Bennett C, Bray J, Harrison A, Martin N, Shepherd A . The CATH database: an extended protein family resource for structural and functional genomics. Nucleic Acids Res. 2003; 31(1):452-5. PMC: 165509. DOI: 10.1093/nar/gkg062. View