» Articles » PMID: 33717033

Binnacle: Using Scaffolds to Improve the Contiguity and Quality of Metagenomic Bins

Overview
Journal Front Microbiol
Specialty Microbiology
Date 2021 Mar 15
PMID 33717033
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

High-throughput sequencing has revolutionized the field of microbiology, however, reconstructing complete genomes of organisms from whole metagenomic shotgun sequencing data remains a challenge. Recovered genomes are often highly fragmented, due to uneven abundances of organisms, repeats within and across genomes, sequencing errors, and strain-level variation. To address the fragmented nature of metagenomic assemblies, scientists rely on a process called binning, which clusters together contigs inferred to originate from the same organism. Existing binning algorithms use oligonucleotide frequencies and contig abundance (coverage) within and across samples to group together contigs from the same organism. However, these algorithms often miss short contigs and contigs from regions with unusual coverage or DNA composition characteristics, such as mobile elements. Here, we propose that information from assembly graphs can assist current strategies for metagenomic binning. We use MetaCarvel, a metagenomic scaffolding tool, to construct assembly graphs where contigs are nodes and edges are inferred based on paired-end reads. We developed a tool, Binnacle, that extracts information from the assembly graphs and clusters scaffolds into comprehensive bins. Binnacle also provides wrapper scripts to integrate with existing binning methods. The Binnacle pipeline can be found on GitHub (https://github.com/marbl/binnacle). We show that binning graph-based scaffolds, rather than contigs, improves the contiguity and quality of the resulting bins, and captures a broader set of the genes of the organisms being reconstructed.

Citing Articles

Unveiling microbial diversity: harnessing long-read sequencing technology.

Agustinho D, Fu Y, Menon V, Metcalf G, Treangen T, Sedlazeck F Nat Methods. 2024; 21(6):954-966.

PMID: 38689099 DOI: 10.1038/s41592-024-02262-1.


BinSPreader: Refine binning results for fuller MAG reconstruction.

Tolstoganov I, Kamenev Y, Kruglikov R, Ochkalova S, Korobeynikov A iScience. 2022; 25(8):104770.

PMID: 35992057 PMC: 9386100. DOI: 10.1016/j.isci.2022.104770.

References
1.
. Structure, function and diversity of the healthy human microbiome. Nature. 2012; 486(7402):207-14. PMC: 3564958. DOI: 10.1038/nature11234. View

2.
Zagordi O, Klein R, Daumer M, Beerenwinkel N . Error correction of next-generation sequencing data and reliable estimation of HIV quasispecies. Nucleic Acids Res. 2010; 38(21):7400-9. PMC: 2995073. DOI: 10.1093/nar/gkq655. View

3.
Mu A, Thomas B, Banfield J, Moreau J . Subsurface carbon monoxide oxidation capacity revealed through genome-resolved metagenomics of a carboxydotroph. Environ Microbiol Rep. 2020; 12(5):525-533. DOI: 10.1111/1758-2229.12868. View

4.
Alneberg J, Bjarnason B, de Bruijn I, Schirmer M, Quick J, Ijaz U . Binning metagenomic contigs by coverage and composition. Nat Methods. 2014; 11(11):1144-6. DOI: 10.1038/nmeth.3103. View

5.
Couvin D, Bernheim A, Toffano-Nioche C, Touchon M, Michalik J, Neron B . CRISPRCasFinder, an update of CRISRFinder, includes a portable version, enhanced performance and integrates search for Cas proteins. Nucleic Acids Res. 2018; 46(W1):W246-W251. PMC: 6030898. DOI: 10.1093/nar/gky425. View