» Articles » PMID: 39990352

Synthetic Community Hi-C Benchmarking Provides a Baseline for Virus-host Inferences

Overview
Journal bioRxiv
Date 2025 Feb 24
PMID 39990352
Authors
Affiliations
Soon will be listed here.
Abstract

Microbiomes are now recognized as key influencers of diverse ecosystems, but it is increasingly evident that viruses impose significant constraints on these microbial communities. While viromics has expanded virus genomic catalogs, identifying hosts for these viruses remains a major challenge due to the limitations in scaling for cultivation and to the uncertain reliability of predictions for understudied virosphere regions. A promising recent advance, Hi-C, a proximity ligation-based method, aims to infer virus-host linkages by analyzing sequences from cross-linked virus and host genomic fragments. This approach has been applied in at least seven studies, yet its accuracy has not been systematically assessed. Here we evaluate Hi-C performance in predicting virus-host interactions using a synthetic community consisting of four bacterial strains and nine phages with known, experimentally determined, quantitative interactions. Our analysis revealed that Hi-C linkage scores used in the literature perform poorly (13% specificity, 100% sensitivity). By converting linkage scores to Z-scores and applying filtering (Z-score ≥ 0.5), we dramatically increased prediction accuracy, though at reduced sensitivity (96% specificity, 57% sensitivity). These findings provide empirical data and establish guidelines for interpreting Hi-C inferred virus-host linkages, with the aim of improving its reliability across diverse ecosystems.

References
1.
Wichels A, Biel S, Gelderblom H, Brinkhoff T, Muyzer G, Schutt C . Bacteriophage diversity in the North Sea. Appl Environ Microbiol. 1998; 64(11):4128-33. PMC: 106618. DOI: 10.1128/AEM.64.11.4128-4133.1998. View

2.
Du Y, Sun F . MetaCC allows scalable and integrative analyses of both long-read and short-read metagenomic Hi-C data. Nat Commun. 2023; 14(1):6231. PMC: 10558524. DOI: 10.1038/s41467-023-41209-6. View

3.
Hwang Y, Roux S, Coclet C, Krause S, Girguis P . Viruses interact with hosts that span distantly related microbial domains in dense hydrothermal mats. Nat Microbiol. 2023; 8(5):946-957. PMC: 10159854. DOI: 10.1038/s41564-023-01347-5. View

4.
Wu R, Davison M, Nelson W, Smith M, Lipton M, Jansson J . Hi-C metagenome sequencing reveals soil phage-host interactions. Nat Commun. 2023; 14(1):7666. PMC: 10667309. DOI: 10.1038/s41467-023-42967-z. View

5.
Kobayashi Y, Kushihara Y, Saito N, Yamaguchi S, Kakimi K . A novel scoring method based on RNA-Seq immunograms describing individual cancer-immunity interactions. Cancer Sci. 2020; 111(11):4031-4040. PMC: 7648030. DOI: 10.1111/cas.14621. View