» Articles » PMID: 23432962

Genome Sequence-based Species Delimitation with Confidence Intervals and Improved Distance Functions

Overview
Publisher Biomed Central
Specialty Biology
Date 2013 Feb 26
PMID 23432962
Citations 1811
Authors
Affiliations
Soon will be listed here.
Abstract

Background: For the last 25 years species delimitation in prokaryotes (Archaea and Bacteria) was to a large extent based on DNA-DNA hybridization (DDH), a tedious lab procedure designed in the early 1970s that served its purpose astonishingly well in the absence of deciphered genome sequences. With the rapid progress in genome sequencing time has come to directly use the now available and easy to generate genome sequences for delimitation of species. GBDP (Genome Blast Distance Phylogeny) infers genome-to-genome distances between pairs of entirely or partially sequenced genomes, a digital, highly reliable estimator for the relatedness of genomes. Its application as an in-silico replacement for DDH was recently introduced. The main challenge in the implementation of such an application is to produce digital DDH values that must mimic the wet-lab DDH values as close as possible to ensure consistency in the Prokaryotic species concept.

Results: Correlation and regression analyses were used to determine the best-performing methods and the most influential parameters. GBDP was further enriched with a set of new features such as confidence intervals for intergenomic distances obtained via resampling or via the statistical models for DDH prediction and an additional family of distance functions. As in previous analyses, GBDP obtained the highest agreement with wet-lab DDH among all tested methods, but improved models led to a further increase in the accuracy of DDH prediction. Confidence intervals yielded stable results when inferred from the statistical models, whereas those obtained via resampling showed marked differences between the underlying distance functions.

Conclusions: Despite the high accuracy of GBDP-based DDH prediction, inferences from limited empirical data are always associated with a certain degree of uncertainty. It is thus crucial to enrich in-silico DDH replacements with confidence-interval estimation, enabling the user to statistically evaluate the outcomes. Such methodological advancements, easily accessible through the web service at http://ggdc.dsmz.de, are crucial steps towards a consistent and truly genome sequence-based classification of microorganisms.

Citing Articles

Description of Heterorhabditis americana n. sp. (Rhabditida, Heterorhabditidae), a new entomopathogenic nematode species isolated in North America.

Machado R, Abolafia J, Robles M, Ruiz-Cuenca A, Bhat A, Shokoohi E Parasit Vectors. 2025; 18(1):101.

PMID: 40069896 PMC: 11899345. DOI: 10.1186/s13071-025-06702-5.


sp. nov., sp. nov. and sp. nov.: three members of group 1 .

McKnight D, Wong-Bajracharya J, Okoh E, Snijders F, Lidbetter F, Webster J Int J Syst Evol Microbiol. 2025; 75(3).

PMID: 40063667 PMC: 11893732. DOI: 10.1099/ijsem.0.006686.


gen. nov., sp. nov., isolated from a secondary infected root canal in the human oral cavity.

Bartsch S, Wittmer A, Weber A, Neumann-Schaal M, Wolf J, Gronow S Int J Syst Evol Microbiol. 2025; 75(3).

PMID: 40042984 PMC: 11881992. DOI: 10.1099/ijsem.0.006690.


Isolation and characterization of fMGyn-Pae01, a phiKZ-like jumbo phage infecting Pseudomonas aeruginosa.

Ranta K, Skurnik M, Kiljunen S Virol J. 2025; 22(1):55.

PMID: 40033410 PMC: 11877940. DOI: 10.1186/s12985-025-02679-w.


EvANI benchmarking workflow for evolutionary distance estimation.

Majidian S, Hwang S, Zakeri M, Langmead B bioRxiv. 2025; .

PMID: 40027788 PMC: 11870633. DOI: 10.1101/2025.02.23.639716.


References
1.
Thorne J, Kishino H . Freeing phylogenies from artifacts of alignment. Mol Biol Evol. 1992; 9(6):1148-62. DOI: 10.1093/oxfordjournals.molbev.a040783. View

2.
Henz S, Huson D, Auch A, Nieselt-Struwe K, Schuster S . Whole-genome prokaryotic phylogeny. Bioinformatics. 2004; 21(10):2329-35. DOI: 10.1093/bioinformatics/bth324. View

3.
Kurtz S, Phillippy A, Delcher A, Smoot M, Shumway M, Antonescu C . Versatile and open software for comparing large genomes. Genome Biol. 2004; 5(2):R12. PMC: 395750. DOI: 10.1186/gb-2004-5-2-r12. View

4.
Saitou N, Nei M . The neighbor-joining method: a new method for reconstructing phylogenetic trees. Mol Biol Evol. 1987; 4(4):406-25. DOI: 10.1093/oxfordjournals.molbev.a040454. View

5.
Goker M, Grimm G, Auch A, Aurahs R, Kucera M . A Clustering Optimization Strategy for Molecular Taxonomy Applied to Planktonic Foraminifera SSU rDNA. Evol Bioinform Online. 2010; 6:97-112. PMC: 2964048. DOI: 10.4137/ebo.s5504. View