» Articles » PMID: 26368049

Detection of Low-abundance Bacterial Strains in Metagenomic Datasets by Eigengenome Partitioning

Overview
Journal Nat Biotechnol
Specialty Biotechnology
Date 2015 Sep 15
PMID 26368049
Citations 66
Authors
Affiliations
Soon will be listed here.
Abstract

Analyses of metagenomic datasets that are sequenced to a depth of billions or trillions of bases can uncover hundreds of microbial genomes, but naive assembly of these data is computationally intensive, requiring hundreds of gigabytes to terabytes of RAM. We present latent strain analysis (LSA), a scalable, de novo pre-assembly method that separates reads into biologically informed partitions and thereby enables assembly of individual genomes. LSA is implemented with a streaming calculation of unobserved variables that we call eigengenomes. Eigengenomes reflect covariance in the abundance of short, fixed-length sequences, or k-mers. As the abundance of each genome in a sample is reflected in the abundance of each k-mer in that genome, eigengenome analysis can be used to partition reads from different genomes. This partitioning can be done in fixed memory using tens of gigabytes of RAM, which makes assembly and downstream analyses of terabytes of data feasible on commodity hardware. Using LSA, we assemble partial and near-complete genomes of bacterial taxa present at relative abundances as low as 0.00001%. We also show that LSA is sensitive enough to separate reads from several strains of the same species.

Citing Articles

Evaluating the potential of assembler-binner combinations in recovering low-abundance and strain-resolved genomes from human metagenomes.

Qayyum H, Talib M, Ali A, Kayani M Heliyon. 2025; 11(2):e41938.

PMID: 39897886 PMC: 11786835. DOI: 10.1016/j.heliyon.2025.e41938.


Metagenomic analysis of rats with diarrhea treated with mixed probiotics: response to consecutive and alternate-hour supplementation.

Wang A, Cui X, Shi C Transl Pediatr. 2024; 13(8):1336-1358.

PMID: 39263297 PMC: 11384429. DOI: 10.21037/tp-24-129.


Olivar: towards automated variant aware primer design for multiplex tiled amplicon sequencing of pathogens.

Wang M, Lou E, Sapoval N, Kim E, Kalvapalle P, Kille B Nat Commun. 2024; 15(1):6306.

PMID: 39060254 PMC: 11282221. DOI: 10.1038/s41467-024-49957-9.


MetaTrass: A high-quality metagenome assembler of the human gut microbiome by cobarcoding sequencing reads.

Qi Y, Gu S, Zhang Y, Guo L, Xu M, Cheng X Imeta. 2024; 1(4):e46.

PMID: 38867906 PMC: 10989976. DOI: 10.1002/imt2.46.


mEnrich-seq: methylation-guided enrichment sequencing of bacterial taxa of interest from microbiome.

Cao L, Kong Y, Fan Y, Ni M, Tourancheau A, Ksiezarek M Nat Methods. 2024; 21(2):236-246.

PMID: 38177508 PMC: 11474163. DOI: 10.1038/s41592-023-02125-1.


References
1.
Morowitz M, Poroyko V, Caplan M, Alverdy J, Liu D . Redefining the role of intestinal microbes in the pathogenesis of necrotizing enterocolitis. Pediatrics. 2010; 125(4):777-85. DOI: 10.1542/peds.2009-3149. View

2.
Pop M . Genome assembly reborn: recent computational challenges. Brief Bioinform. 2009; 10(4):354-66. PMC: 2691937. DOI: 10.1093/bib/bbp026. View

3.
Treangen T, Koren S, Sommer D, Liu B, Astrovskaya I, Ondov B . MetAMOS: a modular and open source metagenomic assembly and analysis pipeline. Genome Biol. 2013; 14(1):R2. PMC: 4053804. DOI: 10.1186/gb-2013-14-1-r2. View

4.
Fierer N, Breitbart M, Nulton J, Salamon P, Lozupone C, Jones R . Metagenomic and small-subunit rRNA analyses reveal the genetic diversity of bacteria, archaea, fungi, and viruses in soil. Appl Environ Microbiol. 2007; 73(21):7059-66. PMC: 2074941. DOI: 10.1128/AEM.00358-07. View

5.
Daniel R . The metagenomics of soil. Nat Rev Microbiol. 2005; 3(6):470-8. DOI: 10.1038/nrmicro1160. View