» Articles » PMID: 21109532

CDD: a Conserved Domain Database for the Functional Annotation of Proteins

Abstract

NCBI's Conserved Domain Database (CDD) is a resource for the annotation of protein sequences with the location of conserved domain footprints, and functional sites inferred from these footprints. CDD includes manually curated domain models that make use of protein 3D structure to refine domain models and provide insights into sequence/structure/function relationships. Manually curated models are organized hierarchically if they describe domain families that are clearly related by common descent. As CDD also imports domain family models from a variety of external sources, it is a partially redundant collection. To simplify protein annotation, redundant models and models describing homologous families are clustered into superfamilies. By default, domain footprints are annotated with the corresponding superfamily designation, on top of which specific annotation may indicate high-confidence assignment of family membership. Pre-computed domain annotation is available for proteins in the Entrez/Protein dataset, and a novel interface, Batch CD-Search, allows the computation and download of annotation for large sets of protein queries. CDD can be accessed via http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.

Citing Articles

Aluminum-activated malate transporter family member CsALMT6 mediates fluoride resistance in tea plants ().

Li Q, Zhang R, Hu X, Ni D, Chen Y, Wang M Hortic Res. 2025; 12(4):uhae353.

PMID: 40046042 PMC: 11879333. DOI: 10.1093/hr/uhae353.


Deciphering Regulatory Networks Governing Seedling Emergence in Deep-Sown Direct-Seeded Rice Cultivation.

Singh J, Sandhu N, Kumar A, Raigar O, Bains S, Augustine G Rice (N Y). 2025; 18(1):5.

PMID: 39918681 PMC: 11806174. DOI: 10.1186/s12284-025-00760-0.


Re-Examination Characterization and Screening of Stripe Rust Resistance Gene of Wheat Gene Family Based on the Transcriptome in Xinchun 32.

Sun T, Yan N, Liu Q, Bai T, Gao H, Chen J Int J Mol Sci. 2025; 26(2).

PMID: 39859355 PMC: 11766189. DOI: 10.3390/ijms26020640.


Exploring the Structural Diversity and Biotechnological Potential of the Rhodophyte Phycolectome.

Rodrigues E, Verza F, Nishimura F, Beleboni R, Hermans C, Janssens K Mar Drugs. 2025; 23(1).

PMID: 39852510 PMC: 11766507. DOI: 10.3390/md23010008.


Ammonium Transporter 1 () Gene Family in Pomegranate: Genome-Wide Analysis and Expression Profiles in Response to Salt Stress.

Omari Alzahrani F Curr Issues Mol Biol. 2025; 47(1).

PMID: 39852174 PMC: 11764171. DOI: 10.3390/cimb47010059.


References
1.
Selengut J, Haft D, Davidsen T, Ganapathy A, Gwinn-Giglio M, Nelson W . TIGRFAMs and Genome Properties: tools for the assignment of molecular function and biological process in prokaryotic genomes. Nucleic Acids Res. 2006; 35(Database issue):D260-4. PMC: 1781115. DOI: 10.1093/nar/gkl1043. View

2.
Tatusov R, Fedorova N, Jackson J, Jacobs A, Kiryutin B, Koonin E . The COG database: an updated version includes eukaryotes. BMC Bioinformatics. 2003; 4:41. PMC: 222959. DOI: 10.1186/1471-2105-4-41. View

3.
Finn R, Mistry J, Tate J, Coggill P, Heger A, Pollington J . The Pfam protein families database. Nucleic Acids Res. 2009; 38(Database issue):D211-22. PMC: 2808889. DOI: 10.1093/nar/gkp985. View

4.
Marchler-Bauer A, Bryant S . CD-Search: protein domain annotations on the fly. Nucleic Acids Res. 2004; 32(Web Server issue):W327-31. PMC: 441592. DOI: 10.1093/nar/gkh454. View

5.
Sayers E, Barrett T, Benson D, Bolton E, Bryant S, Canese K . Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2009; 38(Database issue):D5-16. PMC: 2808881. DOI: 10.1093/nar/gkp967. View