» Articles » PMID: 17586664

Naive Bayesian Classifier for Rapid Assignment of RRNA Sequences into the New Bacterial Taxonomy

Overview
Date 2007 Jun 26
PMID 17586664
Citations 7833
Authors
Affiliations
Soon will be listed here.
Abstract

The Ribosomal Database Project (RDP) Classifier, a naïve Bayesian classifier, can rapidly and accurately classify bacterial 16S rRNA sequences into the new higher-order taxonomy proposed in Bergey's Taxonomic Outline of the Prokaryotes (2nd ed., release 5.0, Springer-Verlag, New York, NY, 2004). It provides taxonomic assignments from domain to genus, with confidence estimates for each assignment. The majority of classifications (98%) were of high estimated confidence (> or = 95%) and high accuracy (98%). In addition to being tested with the corpus of 5,014 type strain sequences from Bergey's outline, the RDP Classifier was tested with a corpus of 23,095 rRNA sequences as assigned by the NCBI into their alternative higher-order taxonomy. The results from leave-one-out testing on both corpora show that the overall accuracies at all levels of confidence for near-full-length and 400-base segments were 89% or above down to the genus level, and the majority of the classification errors appear to be due to anomalies in the current taxonomies. For shorter rRNA segments, such as those that might be generated by pyrosequencing, the error rate varied greatly over the length of the 16S rRNA gene, with segments around the V2 and V4 variable regions giving the lowest error rates. The RDP Classifier is suitable both for the analysis of single rRNA sequences and for the analysis of libraries of thousands of sequences. Another related tool, RDP Library Compare, was developed to facilitate microbial-community comparison based on 16S rRNA gene sequence libraries. It combines the RDP Classifier with a statistical test to flag taxa differentially represented between samples. The RDP Classifier and RDP Library Compare are available online at http://rdp.cme.msu.edu/.

Citing Articles

Unraveling the Mechanism of the Endophytic Bacterial Strain GDW1 in Enhancing Tomato Plant Growth Through Modulation of the Host Transcriptome and Bacteriome.

Ahmed W, Wang Y, Ji W, Liu S, Zhou S, Pan J Int J Mol Sci. 2025; 26(5).

PMID: 40076548 PMC: 11900241. DOI: 10.3390/ijms26051922.


Upper and lower airway microbiota across infancy and childhood.

Hernandez-Leyva A, Rosen A, Tomera C, Lin E, Akaho E, Blatz A Pediatr Res. 2025; .

PMID: 40075175 DOI: 10.1038/s41390-025-03942-0.


Early childhood adiposity, lifestyle and gut microbiome are linked to steatotic liver disease development in adolescents.

Cai C, Zhang Z, Alberti G, Pereira A, De Barbieri F, Garcia C Int J Obes (Lond). 2025; .

PMID: 40075127 DOI: 10.1038/s41366-025-01737-1.


Fungal community composition and function in different Chinese post-fermented teas.

Cui P, Li J, Yao T, Gan Z Sci Rep. 2025; 15(1):8514.

PMID: 40074817 PMC: 11903669. DOI: 10.1038/s41598-025-93420-8.


Cardiac energy metabolic disorder and gut microbiota imbalance: a study on the therapeutic potential of Shenfu Injection in rats with heart failure.

Zhao Z, Hu Z, Li L Front Microbiol. 2025; 16:1509548.

PMID: 40071211 PMC: 11895768. DOI: 10.3389/fmicb.2025.1509548.


References
1.
Cannone J, Subramanian S, Schnare M, Collett J, DSouza L, Du Y . The comparative RNA web (CRW) site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs. BMC Bioinformatics. 2002; 3:2. PMC: 65690. DOI: 10.1186/1471-2105-3-2. View

2.
DeSantis T, Dubosarskiy I, Murray S, Andersen G . Comprehensive aligned sequence construction for automated design of effective probes (CASCADE-P) using 16S rDNA. Bioinformatics. 2003; 19(12):1461-8. DOI: 10.1093/bioinformatics/btg200. View

3.
Wisotzkey J, JURTSHUK Jr P, Fox G, Deinhard G, Poralla K . Comparative sequence analyses on the 16S rRNA (rDNA) of Bacillus acidocaldarius, Bacillus acidoterrestris, and Bacillus cycloheptanicus and proposal for creation of a new genus, Alicyclobacillus gen. nov. Int J Syst Bacteriol. 1992; 42(2):263-9. DOI: 10.1099/00207713-42-2-263. View

4.
Neefs J, Van de Peer Y, De Rijk P, Chapelle S, De Wachter R . Compilation of small ribosomal subunit RNA structures. Nucleic Acids Res. 1993; 21(13):3025-49. PMC: 309731. DOI: 10.1093/nar/21.13.3025. View

5.
Maidak B, Larsen N, McCaughey M, Overbeek R, Olsen G, Fogel K . The Ribosomal Database Project. Nucleic Acids Res. 1994; 22(17):3485-7. PMC: 308308. DOI: 10.1093/nar/22.17.3485. View