» Articles » PMID: 30252023

MSPminer: Abundance-based Reconstitution of Microbial Pan-genomes from Shotgun Metagenomic Data

Overview
Journal Bioinformatics
Specialty Biology
Date 2018 Sep 26
PMID 30252023
Citations 58
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Analysis toolkits for shotgun metagenomic data achieve strain-level characterization of complex microbial communities by capturing intra-species gene content variation. Yet, these tools are hampered by the extent of reference genomes that are far from covering all microbial variability, as many species are still not sequenced or have only few strains available. Binning co-abundant genes obtained from de novo assembly is a powerful reference-free technique to discover and reconstitute gene repertoire of microbial species. While current methods accurately identify species core parts, they miss many accessory genes or split them into small gene groups that remain unassociated to core clusters.

Results: We introduce MSPminer, a computationally efficient software tool that reconstitutes Metagenomic Species Pan-genomes (MSPs) by binning co-abundant genes across metagenomic samples. MSPminer relies on a new robust measure of proportionality coupled with an empirical classifier to group and distinguish not only species core genes but accessory genes also. Applied to a large scale metagenomic dataset, MSPminer successfully delineates in a few hours the gene repertoires of 1661 microbial species with similar specificity and higher sensitivity than existing tools. The taxonomic annotation of MSPs reveals microorganisms hitherto unknown and brings coherence in the nomenclature of the species of the human gut microbiota. The provided MSPs can be readily used for taxonomic profiling and biomarkers discovery in human gut metagenomic samples. In addition, MSPminer can be applied on gene count tables from other ecosystems to perform similar analyses.

Availability And Implementation: The binary is freely available for non-commercial users at www.enterome.com/downloads.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Gut heavy metal and antibiotic resistome of humans living in the high Arctic.

Hauptmann A, Johansen J, Staeger F, Nielsen D, Mulvad G, Hanghoj K Front Microbiol. 2024; 15:1493803.

PMID: 39539714 PMC: 11557323. DOI: 10.3389/fmicb.2024.1493803.


Commensal consortia decolonize Enterobacteriaceae via ecological control.

Furuichi M, Kawaguchi T, Pust M, Yasuma-Mitobe K, Plichta D, Hasegawa N Nature. 2024; 633(8031):878-886.

PMID: 39294375 PMC: 11424487. DOI: 10.1038/s41586-024-07960-6.


Global compositional and functional states of the human gut microbiome in health and disease.

Lee S, Portlock T, Le Chatelier E, Garcia-Guevara F, Clasen F, Plaza Onate F Genome Res. 2024; 34(6):967-978.

PMID: 39038849 PMC: 11293553. DOI: 10.1101/gr.278637.123.


Driving gut microbiota enterotypes through host genetics.

Larzul C, Estelle J, Borey M, Blanc F, Lemonnier G, Billon Y Microbiome. 2024; 12(1):116.

PMID: 38943206 PMC: 11214205. DOI: 10.1186/s40168-024-01827-8.


Privacy-Preserving Federated Survival Support Vector Machines for Cross-Institutional Time-To-Event Analysis: Algorithm Development and Validation.

Spath J, Sewald Z, Probul N, Berland M, Almeida M, Pons N JMIR AI. 2024; 3:e47652.

PMID: 38875678 PMC: 11041494. DOI: 10.2196/47652.


References
1.
Medini D, Donati C, Tettelin H, Masignani V, Rappuoli R . The microbial pan-genome. Curr Opin Genet Dev. 2005; 15(6):589-94. DOI: 10.1016/j.gde.2005.09.006. View

2.
Koonin E, Wolf Y . Genomics of bacteria and archaea: the emerging dynamic view of the prokaryotic world. Nucleic Acids Res. 2008; 36(21):6688-719. PMC: 2588523. DOI: 10.1093/nar/gkn668. View

3.
Touchon M, Hoede C, Tenaillon O, Barbe V, Baeriswyl S, Bidet P . Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths. PLoS Genet. 2009; 5(1):e1000344. PMC: 2617782. DOI: 10.1371/journal.pgen.1000344. View

4.
Scaria J, Ponnala L, Janvilisri T, Yan W, Mueller L, Chang Y . Analysis of ultra low genome conservation in Clostridium difficile. PLoS One. 2010; 5(12):e15147. PMC: 2999544. DOI: 10.1371/journal.pone.0015147. View

5.
Fodor A, DeSantis T, Wylie K, Badger J, Ye Y, Hepburn T . The "most wanted" taxa from the human microbiome for whole genome sequencing. PLoS One. 2012; 7(7):e41294. PMC: 3406062. DOI: 10.1371/journal.pone.0041294. View