» Articles » PMID: 34172093

SCAPP: an Algorithm for Improved Plasmid Assembly in Metagenomes

Overview
Journal Microbiome
Publisher Biomed Central
Specialties Genetics
Microbiology
Date 2021 Jun 26
PMID 34172093
Citations 30
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Metagenomic sequencing has led to the identification and assembly of many new bacterial genome sequences. These bacteria often contain plasmids: usually small, circular double-stranded DNA molecules that may transfer across bacterial species and confer antibiotic resistance. These plasmids are generally less studied and understood than their bacterial hosts. Part of the reason for this is insufficient computational tools enabling the analysis of plasmids in metagenomic samples.

Results: We developed SCAPP (Sequence Contents-Aware Plasmid Peeler)-an algorithm and tool to assemble plasmid sequences from metagenomic sequencing. SCAPP builds on some key ideas from the Recycler algorithm while improving plasmid assemblies by integrating biological knowledge about plasmids. We compared the performance of SCAPP to Recycler and metaplasmidSPAdes on simulated metagenomes, real human gut microbiome samples, and a human gut plasmidome dataset that we generated. We also created plasmidome and metagenome data from the same cow rumen sample and used the parallel sequencing data to create a novel assessment procedure. Overall, SCAPP outperformed Recycler and metaplasmidSPAdes across this wide range of datasets.

Conclusions: SCAPP is an easy to use Python package that enables the assembly of full plasmid sequences from metagenomic samples. It outperformed existing metagenomic plasmid assemblers in most cases and assembled novel and clinically relevant plasmids in samples we generated such as a human gut plasmidome. SCAPP is open-source software available from: https://github.com/Shamir-Lab/SCAPP . Video abstract.

Citing Articles

Metagenomic analysis of pristine oil sheds new light on the global distribution of microbial genetic repertoire in hydrocarbon-associated ecosystems.

Plewka J, Alibrandi A, Bornemann T, Esser S, Stach T, Sures K Microlife. 2025; 6:uqae027.

PMID: 39877152 PMC: 11774207. DOI: 10.1093/femsml/uqae027.


Integrative genomics would strengthen AMR understanding through ONE health approach.

Liu C, Pandey R Heliyon. 2025; 10(14):e34719.

PMID: 39816336 PMC: 11734142. DOI: 10.1016/j.heliyon.2024.e34719.


Chromosomal Type II Toxin-Antitoxin Systems May Enhance Bacterial Fitness of a Hybrid Pathogenic Strain Under Stress Conditions.

Silva J, Marques-Neto L, Carvalho E, Del Carpio A, Henrique C, Leite L Toxins (Basel). 2024; 16(11).

PMID: 39591224 PMC: 11598369. DOI: 10.3390/toxins16110469.


Sequencing Strategy to Ensure Accurate Plasmid Assembly.

Hernandez S, Berezin C, Miller K, Peccoud S, Peccoud J ACS Synth Biol. 2024; 13(12):4099-4109.

PMID: 39508818 PMC: 11706207. DOI: 10.1021/acssynbio.4c00539.


Mixed waste contamination selects for a mobile genetic element population enriched in multiple heavy metal resistance genes.

Goff J, Lui L, Nielsen T, Poole F, Smith H, Walker K ISME Commun. 2024; 4(1):ycae064.

PMID: 38800128 PMC: 11128244. DOI: 10.1093/ismeco/ycae064.


References
1.
Antipov D, Hartwick N, Shen M, Raiko M, Lapidus A, Pevzner P . plasmidSPAdes: assembling plasmids from whole genome sequencing data. Bioinformatics. 2016; 32(22):3380-3387. DOI: 10.1093/bioinformatics/btw493. View

2.
Antipov D, Raiko M, Lapidus A, Pevzner P . Plasmid detection and assembly in genomic and metagenomic data sets. Genome Res. 2019; 29(6):961-968. PMC: 6581055. DOI: 10.1101/gr.241299.118. View

3.
Vrieze A, van Nood E, Holleman F, Salojarvi J, Kootte R, Bartelsman J . Transfer of intestinal microbiota from lean donors increases insulin sensitivity in individuals with metabolic syndrome. Gastroenterology. 2012; 143(4):913-6.e7. DOI: 10.1053/j.gastro.2012.06.031. View

4.
Krawczyk P, Lipinski L, Dziembowski A . PlasFlow: predicting plasmid sequences in metagenomic data using genome signatures. Nucleic Acids Res. 2018; 46(6):e35. PMC: 5887522. DOI: 10.1093/nar/gkx1321. View

5.
Zhou F, Xu Y . cBar: a computer program to distinguish plasmid-derived from chromosome-derived sequence fragments in metagenomics data. Bioinformatics. 2010; 26(16):2051-2. PMC: 2916713. DOI: 10.1093/bioinformatics/btq299. View