» Articles » PMID: 20033048

A Phylogeny-driven Genomic Encyclopaedia of Bacteria and Archaea

Abstract

Sequencing of bacterial and archaeal genomes has revolutionized our understanding of the many roles played by microorganisms. There are now nearly 1,000 completed bacterial and archaeal genomes available, most of which were chosen for sequencing on the basis of their physiology. As a result, the perspective provided by the currently available genomes is limited by a highly biased phylogenetic distribution. To explore the value added by choosing microbial genomes for sequencing on the basis of their evolutionary relationships, we have sequenced and analysed the genomes of 56 culturable species of Bacteria and Archaea selected to maximize phylogenetic coverage. Analysis of these genomes demonstrated pronounced benefits (compared to an equivalent set of genomes randomly selected from the existing database) in diverse areas including the reconstruction of phylogenetic history, the discovery of new protein families and biological properties, and the prediction of functions for known genes from other organisms. Our results strongly support the need for systematic 'phylogenomic' efforts to compile a phylogeny-driven 'Genomic Encyclopedia of Bacteria and Archaea' in order to derive maximum knowledge from existing microbial genome data as well as from genome sequences to come.

Citing Articles

A metagenomic perspective on the microbial prokaryotic genome census.

Wu D, Seshadri R, Kyrpides N, Ivanova N Sci Adv. 2025; 11(3):eadq2166.

PMID: 39823337 PMC: 11740963. DOI: 10.1126/sciadv.adq2166.


Machine learning classification of archaea and bacteria identifies novel predictive genomic features.

Bobbo T, Biscarini F, Yaddehige S, Alberghini L, Rigoni D, Bianchi N BMC Genomics. 2024; 25(1):955.

PMID: 39402493 PMC: 11472548. DOI: 10.1186/s12864-024-10832-y.


: delineation of fungal genera based on phylogenomic analyses, genomic relatedness indices and genomics-based synapomorphies.

Liu F, Hu Z, Yurkov A, Chen X, Bao W, Ma Q Persoonia. 2024; 52:1-21.

PMID: 39161631 PMC: 11319838. DOI: 10.3767/persoonia.2024.52.01.


Why and how to use the SeqCode.

Whitman W, Chuvochina M, Hedlund B, Konstantinidis K, Palmer M, Rodriguez-R L mLife. 2024; 3(1):1-13.

PMID: 38827511 PMC: 11139209. DOI: 10.1002/mlf2.12092.


The diversity and ecological significance of microbial traits potentially involved in B biosynthesis in the global ocean.

Zhou J, Qin W, Lu X, Yang Y, Stahl D, Jiao N mLife. 2024; 2(4):416-427.

PMID: 38818271 PMC: 10989127. DOI: 10.1002/mlf2.12095.


References
1.
Waino M, Ingvorsen K . Production of beta-xylanase and beta-xylosidase by the extremely halophilic archaeon Halorhabdus utahensis. Extremophiles. 2003; 7(2):87-93. DOI: 10.1007/s00792-002-0299-y. View

2.
Liolios K, Mavromatis K, Tavernarakis N, Kyrpides N . The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata. Nucleic Acids Res. 2007; 36(Database issue):D475-9. PMC: 2238992. DOI: 10.1093/nar/gkm884. View

3.
Pace N . A molecular view of microbial diversity and the biosphere. Science. 1997; 276(5313):734-40. DOI: 10.1126/science.276.5313.734. View

4.
Bernal A, Ear U, Kyrpides N . Genomes OnLine Database (GOLD): a monitor of genome projects world-wide. Nucleic Acids Res. 2000; 29(1):126-7. PMC: 29859. DOI: 10.1093/nar/29.1.126. View

5.
Enright A, Van Dongen S, Ouzounis C . An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 2002; 30(7):1575-84. PMC: 101833. DOI: 10.1093/nar/30.7.1575. View