» Articles » PMID: 21690100

Computational Methods for Gene Orthology Inference

Overview
Journal Brief Bioinform
Specialty Biology
Date 2011 Jun 22
PMID 21690100
Citations 118
Authors
Affiliations
Soon will be listed here.
Abstract

Accurate inference of orthologous genes is a pre-requisite for most comparative genomics studies, and is also important for functional annotation of new genomes. Identification of orthologous gene sets typically involves phylogenetic tree analysis, heuristic algorithms based on sequence conservation, synteny analysis, or some combination of these approaches. The most direct tree-based methods typically rely on the comparison of an individual gene tree with a species tree. Once the two trees are accurately constructed, orthologs are straightforwardly identified by the definition of orthology as those homologs that are related by speciation, rather than gene duplication, at their most recent point of origin. Although ideal for the purpose of orthology identification in principle, phylogenetic trees are computationally expensive to construct for large numbers of genes and genomes, and they often contain errors, especially at large evolutionary distances. Moreover, in many organisms, in particular prokaryotes and viruses, evolution does not appear to have followed a simple 'tree-like' mode, which makes conventional tree reconciliation inapplicable. Other, heuristic methods identify probable orthologs as the closest homologous pairs or groups of genes in a set of organisms. These approaches are faster and easier to automate than tree-based methods, with efficient implementations provided by graph-theoretical algorithms enabling comparisons of thousands of genomes. Comparisons of these two approaches show that, despite conceptual differences, they produce similar sets of orthologs, especially at short evolutionary distances. Synteny also can aid in identification of orthologs. Often, tree-based, sequence similarity- and synteny-based approaches can be combined into flexible hybrid methods.

Citing Articles

Exploration of the genetic landscape of bacterial dsDNA viruses reveals an ANI gap amid extensive mosaicism.

Ndovie W, Havranek J, Leconte J, Koszucki J, Chindelevitch L, Adriaenssens E mSystems. 2025; 10(2):e0166124.

PMID: 39878503 PMC: 11834439. DOI: 10.1128/msystems.01661-24.


Hayai-Annotation: A functional gene prediction tool that integrates orthologs and gene ontology for network analysis in plant species.

Ghelfi A, Isobe S Comput Struct Biotechnol J. 2025; 27():117-126.

PMID: 39830014 PMC: 11742577. DOI: 10.1016/j.csbj.2024.12.011.


getphylo: rapid and automatic generation of multi-locus phylogenetic trees.

Booth T, Shaw S, Cruz-Morales P, Weber T BMC Bioinformatics. 2025; 26(1):21.

PMID: 39827349 PMC: 11748604. DOI: 10.1186/s12859-025-06035-1.


Genome sequencing of and its comparative analysis with malacostracan crustaceans.

SoundharaPandiyan N, Alphonse C, Thanumalaya S, Vincent S, Kannan R 3 Biotech. 2024; 14(11):276.

PMID: 39464522 PMC: 11499489. DOI: 10.1007/s13205-024-04121-4.


Homoeologs in Allopolyploids: Navigating Redundancy as Both an Evolutionary Opportunity and a Technical Challenge-A Transcriptomics Perspective.

Aufiero G, Fruggiero C, DAngelo D, DAgostino N Genes (Basel). 2024; 15(8).

PMID: 39202338 PMC: 11353593. DOI: 10.3390/genes15080977.


References
1.
Boucher Y, Douady C, Papke R, Walsh D, Boudreau M, Nesbo C . Lateral gene transfer and the origins of prokaryotic groups. Annu Rev Genet. 2003; 37:283-328. DOI: 10.1146/annurev.genet.37.050503.084247. View

2.
Diaz R, Vargas-Lagunas C, Villalobos M, Peralta H, Mora Y, Encarnacion S . argC Orthologs from Rhizobiales show diverse profiles of transcriptional efficiency and functionality in Sinorhizobium meliloti. J Bacteriol. 2010; 193(2):460-72. PMC: 3019832. DOI: 10.1128/JB.01010-10. View

3.
Tatusov R, Natale D, Garkavtsev I, Tatusova T, Shankavaram U, Rao B . The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 2000; 29(1):22-8. PMC: 29819. DOI: 10.1093/nar/29.1.22. View

4.
Suyama M, Bork P . Evolution of prokaryotic gene order: genome rearrangements in closely related species. Trends Genet. 2001; 17(1):10-3. DOI: 10.1016/s0168-9525(00)02159-4. View

5.
Koonin E, Makarova K, Aravind L . Horizontal gene transfer in prokaryotes: quantification and classification. Annu Rev Microbiol. 2001; 55:709-42. PMC: 4781227. DOI: 10.1146/annurev.micro.55.1.709. View