» Articles » PMID: 10592175

The COG Database: a Tool for Genome-scale Analysis of Protein Functions and Evolution

Overview
Specialty Biochemistry
Date 1999 Dec 11
PMID 10592175
Citations 2145
Authors
Affiliations
Soon will be listed here.
Abstract

Rational classification of proteins encoded in sequenced genomes is critical for making the genome sequences maximally useful for functional and evolutionary studies. The database of Clusters of Orthologous Groups of proteins (COGs) is an attempt on a phylogenetic classification of the proteins encoded in 21 complete genomes of bacteria, archaea and eukaryotes (http://www. ncbi.nlm. nih.gov/COG). The COGs were constructed by applying the criterion of consistency of genome-specific best hits to the results of an exhaustive comparison of all protein sequences from these genomes. The database comprises 2091 COGs that include 56-83% of the gene products from each of the complete bacterial and archaeal genomes and approximately 35% of those from the yeast Saccharomyces cerevisiae genome. The COG database is accompanied by the COGNITOR program that is used to fit new proteins into the COGs and can be applied to functional and phylogenetic annotation of newly sequenced genomes.

Citing Articles

Genome Sequence, Comparative Genome Analysis, and Expression Profiling of the Chitinase GH18 Gene Family in Bd01.

Zhu T, Hussain M, Ning J, Chen X, Shi C, Yang D Int J Mol Sci. 2025; 26(5).

PMID: 40076665 PMC: 11900538. DOI: 10.3390/ijms26052031.


Metabolome and Transcriptome Analyses Reveal the Correlation Between Fructan Changes and Phytohormone Regulation During Tuber Sprouting of L.

Wen Y, Zhou Z, Guo X, Li J, Wang G, Sun X Int J Mol Sci. 2025; 26(5).

PMID: 40076491 PMC: 11899686. DOI: 10.3390/ijms26051864.


Chromosome-level genome assembly of Jaguar guapote (Parachromis manguensis) by massive parallel sequencing.

Cao J, Tong Y, Xiao Z, Chen H, Liu Z Sci Data. 2025; 12(1):411.

PMID: 40064893 PMC: 11894119. DOI: 10.1038/s41597-025-04752-z.


Cyanobacterial circadian regulation enhances bioproduction under subjective nighttime through rewiring of carbon partitioning dynamics, redox balance orchestration, and cell cycle modulation.

Gilliam A, Sadler N, Li X, Garcia M, Johnson Z, Velickovic M Microb Cell Fact. 2025; 24(1):56.

PMID: 40055679 PMC: 11889915. DOI: 10.1186/s12934-025-02665-5.


Tabrizicola caldifontis sp. nov., Isolated from Hot Spring Sediment Sample.

Habib N, Khan I, Saqib M, Hejazi M, Tarhriz V, Jan S Curr Microbiol. 2025; 82(4):172.

PMID: 40050427 DOI: 10.1007/s00284-025-04156-7.


References
1.
FITCH W . Distinguishing homologous from analogous proteins. Syst Zool. 1970; 19(2):99-113. View

2.
Riley M . Functions of the gene products of Escherichia coli. Microbiol Rev. 1993; 57(4):862-952. PMC: 372942. DOI: 10.1128/mr.57.4.862-952.1993. View

3.
Thompson J, Higgins D, Gibson T . CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994; 22(22):4673-80. PMC: 308517. DOI: 10.1093/nar/22.22.4673. View

4.
FITCH W . Uses for evolutionary trees. Philos Trans R Soc Lond B Biol Sci. 1995; 349(1327):93-102. DOI: 10.1098/rstb.1995.0095. View

5.
Koonin E . Genome sequences: genome sequence of a model prokaryote. Curr Biol. 1997; 7(10):R656-9. DOI: 10.1016/s0960-9822(06)00328-9. View