» Articles » PMID: 24855317

Species-level Deconvolution of Metagenome Assemblies with Hi-C-based Contact Probability Maps

Overview
Journal G3 (Bethesda)
Date 2014 May 24
PMID 24855317
Citations 99
Authors
Affiliations
Soon will be listed here.
Abstract

Microbial communities consist of mixed populations of organisms, including unknown species in unknown abundances. These communities are often studied through metagenomic shotgun sequencing, but standard library construction methods remove long-range contiguity information; thus, shotgun sequencing and de novo assembly of a metagenome typically yield a collection of contigs that cannot readily be grouped by species. Methods for generating chromatin-level contact probability maps, e.g., as generated by the Hi-C method, provide a signal of contiguity that is completely intracellular and contains both intrachromosomal and interchromosomal information. Here, we demonstrate how this signal can be exploited to reconstruct the individual genomes of microbial species present within a mixed sample. We apply this approach to two synthetic metagenome samples, successfully clustering the genome content of fungal, bacterial, and archaeal species with more than 99% agreement with published reference genomes. We also show that the Hi-C signal can secondarily be used to create scaffolded genome assemblies of individual eukaryotic species present within the microbial community, with higher levels of contiguity than some of the species' published reference genomes.

Citing Articles

Shotgun and Hi-C Sequencing Datasets for Binning Wheat Rhizosphere Microbiome.

Regmi R, Anderson J, Burgess L, Mangelson H, Liachko I, Vadakattu G Sci Data. 2025; 12(1):367.

PMID: 40025082 PMC: 11873117. DOI: 10.1038/s41597-025-04651-3.


MOSTPLAS: a self-correction multi-label learning model for plasmid host range prediction.

Zou W, Ji Y, Guan J, Sun Y Bioinformatics. 2025; 41(3).

PMID: 39960880 PMC: 11897426. DOI: 10.1093/bioinformatics/btaf075.


Chromosome-level genome assembly of Phortica okadai, a vector of Thelazia callipaeda.

Wang L, Yu H, Luo B, Yan R, Zhou J, Liu H Sci Data. 2024; 11(1):1370.

PMID: 39695142 PMC: 11655863. DOI: 10.1038/s41597-024-04239-3.


Using genomics to explore the epidemiology of vancomycin resistance in a sewage system.

Jensen E, Otani S, Liachko I, Auch B, Aarestrup F Microbiol Spectr. 2024; 13(1):e0148924.

PMID: 39656004 PMC: 11705837. DOI: 10.1128/spectrum.01489-24.


CRISPR spacers acquired from plasmids primarily target backbone genes, making them valuable for predicting potential hosts and host range.

Androsiuk L, Maane S, Tal S Microbiol Spectr. 2024; :e0010424.

PMID: 39508585 PMC: 11619364. DOI: 10.1128/spectrum.00104-24.


References
1.
. Structure, function and diversity of the healthy human microbiome. Nature. 2012; 486(7402):207-14. PMC: 3564958. DOI: 10.1038/nature11234. View

2.
Yaffe E, Tanay A . Probabilistic modeling of Hi-C contact maps eliminates systematic biases to characterize global chromosomal architecture. Nat Genet. 2011; 43(11):1059-65. DOI: 10.1038/ng.947. View

3.
Hug L, Castelle C, Wrighton K, Thomas B, Sharon I, Frischkorn K . Community genomic analyses constrain the distribution of metabolic traits across the Chloroflexi phylum and indicate roles in sediment carbon cycling. Microbiome. 2014; 1(1):22. PMC: 3971608. DOI: 10.1186/2049-2618-1-22. View

4.
Saeed I, Tang S, Halgamuge S . Unsupervised discovery of microbial population structure within metagenomes using nucleotide base composition. Nucleic Acids Res. 2011; 40(5):e34. PMC: 3300000. DOI: 10.1093/nar/gkr1204. View

5.
Dekker J, Marti-Renom M, Mirny L . Exploring the three-dimensional organization of genomes: interpreting chromatin interaction data. Nat Rev Genet. 2013; 14(6):390-403. PMC: 3874835. DOI: 10.1038/nrg3454. View