» Articles » PMID: 34174810

PlasForest: a Homology-based Random Forest Classifier for Plasmid Detection in Genomic Datasets

Overview
Publisher Biomed Central
Specialty Biology
Date 2021 Jun 27
PMID 34174810
Citations 18
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Plasmids are mobile genetic elements that often carry accessory genes, and are vectors for horizontal transfer between bacterial genomes. Plasmid detection in large genomic datasets is crucial to analyze their spread and quantify their role in bacteria adaptation and particularly in antibiotic resistance propagation. Bioinformatics methods have been developed to detect plasmids. However, they suffer from low sensitivity (i.e., most plasmids remain undetected) or low precision (i.e., these methods identify chromosomes as plasmids), and are overall not adapted to identify plasmids in whole genomes that are not fully assembled (contigs and scaffolds).

Results: We developed PlasForest, a homology-based random forest classifier identifying bacterial plasmid sequences in partially assembled genomes. Without knowing the taxonomical origin of the samples, PlasForest identifies contigs as plasmids or chromosomes with a F1 score of 0.950. Notably, it can detect 77.4% of plasmid contigs below 1 kb with 2.8% of false positives and 99.9% of plasmid contigs over 50 kb with 2.2% of false positives.

Conclusions: PlasForest outperforms other currently available tools on genomic datasets by being both sensitive and precise. The performance of PlasForest on metagenomic assemblies are currently well below those of other k-mer-based methods, and we discuss how homology-based approaches could improve plasmid detection in such datasets.

Citing Articles

Plaseval: a framework for comparing and evaluating plasmid detection tools.

Mane A, Sanderson H, White A, Zaheer R, Beiko R, Chauve C BMC Bioinformatics. 2024; 25(1):365.

PMID: 39592962 PMC: 11590284. DOI: 10.1186/s12859-024-05941-0.


MOBFinder: a tool for mobilization typing of plasmid metagenomic fragments based on a language model.

Feng T, Wu S, Zhou H, Fang Z Gigascience. 2024; 13.

PMID: 39101782 PMC: 11299106. DOI: 10.1093/gigascience/giae047.


PlasmidHunter: accurate and fast prediction of plasmid sequences using gene content profile and machine learning.

Tian R, Zhou J, Imanian B Brief Bioinform. 2024; 25(4).

PMID: 38960405 PMC: 11770376. DOI: 10.1093/bib/bbae322.


Effect of a probiotic and an antibiotic on the mobilome of the porcine microbiota.

Monger X, Saucier L, Guay F, Turcotte A, Lemieux J, Pouliot E Front Genet. 2024; 15:1355134.

PMID: 38606356 PMC: 11006968. DOI: 10.3389/fgene.2024.1355134.


Plasmids in the human gut reveal neutral dispersal and recombination that is overpowered by inflammatory diseases.

Zorea A, Pellow D, Levin L, Pilosof S, Friedman J, Shamir R Nat Commun. 2024; 15(1):3147.

PMID: 38605009 PMC: 11009399. DOI: 10.1038/s41467-024-47272-x.


References
1.
Heuer H, Binh C, Jechalke S, Kopmann C, Zimmerling U, Krogerrecklenfort E . IncP-1ε Plasmids are Important Vectors of Antibiotic Resistance Genes in Agricultural Systems: Diversification Driven by Class 1 Integron Gene Cassettes. Front Microbiol. 2012; 3:2. PMC: 3260659. DOI: 10.3389/fmicb.2012.00002. View

2.
Krawczyk P, Lipinski L, Dziembowski A . PlasFlow: predicting plasmid sequences in metagenomic data using genome signatures. Nucleic Acids Res. 2018; 46(6):e35. PMC: 5887522. DOI: 10.1093/nar/gkx1321. View

3.
Nishida H . Evolution of genome base composition and genome size in bacteria. Front Microbiol. 2012; 3:420. PMC: 3515811. DOI: 10.3389/fmicb.2012.00420. View

4.
Vielva L, de Toro M, Lanza V, de la Cruz F . PLACNETw: a web-based tool for plasmid reconstruction from bacterial genomes. Bioinformatics. 2017; 33(23):3796-3798. DOI: 10.1093/bioinformatics/btx462. View

5.
Poolkhet C, Chumsing S, Wajjwalku W, Minato C, Otsu Y, Takai S . Plasmid Profiles and Prevalence of Intermediately Virulent Rhodococcus equi from Pigs in Nakhonpathom Province, Thailand: Identification of a New Variant of the 70-kb Virulence Plasmid, Type 18. Vet Med Int. 2010; 2010:491624. PMC: 2860478. DOI: 10.4061/2010/491624. View