» Articles » PMID: 17942413

EggNOG: Automated Construction and Annotation of Orthologous Groups of Genes

Overview
Specialty Biochemistry
Date 2007 Oct 19
PMID 17942413
Citations 261
Authors
Affiliations
Soon will be listed here.
Abstract

The identification of orthologous genes forms the basis for most comparative genomics studies. Existing approaches either lack functional annotation of the identified orthologous groups, hampering the interpretation of subsequent results, or are manually annotated and thus lag behind the rapid sequencing of new genomes. Here we present the eggNOG database ('evolutionary genealogy of genes: Non-supervised Orthologous Groups'), which contains orthologous groups constructed from Smith-Waterman alignments through identification of reciprocal best matches and triangular linkage clustering. Applying this procedure to 312 bacterial, 26 archaeal and 35 eukaryotic genomes yielded 43 582 course-grained orthologous groups of which 9724 are extended versions of those from the original COG/KOG database. We also constructed more fine-grained groups for selected subsets of organisms, such as the 19 914 mammalian orthologous groups. We automatically annotated our non-supervised orthologous groups with functional descriptions, which were derived by identifying common denominators for the genes based on their individual textual descriptions, annotated functional categories, and predicted protein domains. The orthologous groups in eggNOG contain 1 241 751 genes and provide at least a broad functional description for 77% of them. Users can query the resource for individual genes via a web interface or download the complete set of orthologous groups at http://eggnog.embl.de.

Citing Articles

Genome-Wide Identification and Analysis of the Gene Family in .

Gong Z, Wu X, Luo Y, Zhou T, Yang Z, Wu Y Curr Issues Mol Biol. 2025; 47(2).

PMID: 39996821 PMC: 11854332. DOI: 10.3390/cimb47020100.


A metric and its derived protein network for evaluation of ortholog database inconsistency.

Yang W, Ji J, Fang G BMC Bioinformatics. 2025; 26(1):6.

PMID: 39773281 PMC: 11707888. DOI: 10.1186/s12859-024-06023-x.


Microbial community structure and functional traits involved in the adaptation of culturable bacteria within the gut of amphipods from the deepest ocean.

Cui Y, Xiao Y, Wang Z, Ji P, Zhang C, Li Y Microbiol Spectr. 2024; 13(1):e0072324.

PMID: 39655934 PMC: 11705852. DOI: 10.1128/spectrum.00723-24.


Chromosome-level genome assembly of a stored-product psocid, Liposcelis tricolor (Psocodea: Liposcelididae).

Jiang S, Chen Y, Sun S, Smagghe G, Wang J, Wei D Sci Data. 2024; 11(1):1310.

PMID: 39622886 PMC: 11612421. DOI: 10.1038/s41597-024-04179-y.


Increased rumen Prevotella enhances BCAA synthesis, leading to synergistically increased skeletal muscle in myostatin-knockout cattle.

Hai C, Hao Z, Bu L, Lei J, Liu X, Zhao Y Commun Biol. 2024; 7(1):1575.

PMID: 39592704 PMC: 11599727. DOI: 10.1038/s42003-024-07252-9.


References
1.
Wapinski I, Pfeffer A, Friedman N, Regev A . Automatic genome-wide reconstruction of phylogenetic gene trees. Bioinformatics. 2007; 23(13):i549-58. DOI: 10.1093/bioinformatics/btm193. View

2.
Koonin E . Orthologs, paralogs, and evolutionary genomics. Annu Rev Genet. 2005; 39:309-38. DOI: 10.1146/annurev.genet.39.073003.114725. View

3.
Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita K, Itoh M, Kawashima S . From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2005; 34(Database issue):D354-7. PMC: 1347464. DOI: 10.1093/nar/gkj102. View

4.
Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J . Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000; 25(1):25-9. PMC: 3037419. DOI: 10.1038/75556. View

5.
Tatusov R, Koonin E, Lipman D . A genomic perspective on protein families. Science. 1997; 278(5338):631-7. DOI: 10.1126/science.278.5338.631. View