» Articles » PMID: 35140890

Predicting the Capsid Architecture of Phages from Metagenomic Data

Overview
Specialty Biotechnology
Date 2022 Feb 10
PMID 35140890
Authors
Affiliations
Soon will be listed here.
Abstract

Tailed phages are viruses that infect bacteria and are the most abundant biological entities on Earth. Their ecological, evolutionary, and biogeochemical roles in the planet stem from their genomic diversity. Known tailed phage genomes range from 10 to 735 kilobase pairs thanks to the size variability of the protective protein capsids that store them. However, the role of tailed phage capsids' diversity in ecosystems is unclear. A fundamental gap is the difficulty of associating genomic information with viral capsids in the environment. To address this problem, here, we introduce a computational approach to predict the capsid architecture (T-number) of tailed phages using the sequence of a single gene-the major capsid protein. This approach relies on an allometric model that relates the genome length and capsid architecture of tailed phages. This allometric model was applied to isolated phage genomes to generate a library that associated major capsid proteins and putative capsid architectures. This library was used to train machine learning methods, and the most computationally scalable model investigated (random forest) was applied to human gut metagenomes. Compared to isolated phages, the analysis of gut data reveals a large abundance of mid-sized (T = 7) capsids, as expected, followed by a relatively large frequency of jumbo-like tailed phage capsids (T ≥ 25) and small capsids (T = 4) that have been under-sampled. We discussed how to increase the method's accuracy and how to extend the approach to other viruses. The computational pipeline introduced here opens the doors to monitor the ongoing evolution and selection of viral capsids across ecosystems.

Citing Articles

Modeling Viral Capsid Assembly: A Review of Computational Strategies and Applications.

Guo W, Alarcon E, Sanchez J, Xiao C, Li L Cells. 2025; 13(24.

PMID: 39768179 PMC: 11674207. DOI: 10.3390/cells13242088.


Theoretical Studies on Assembly, Physical Stability, and Dynamics of Viruses.

Luque A, Reguera D Subcell Biochem. 2024; 105:693-741.

PMID: 39738961 DOI: 10.1007/978-3-031-65187-8_19.


Genomic analysis and characterization of lytic bacteriophages that target antimicrobial resistant in Addis Ababa, Ethiopia.

Sada T, Hailu Alemayehu D, Tafese K, Tessema T Heliyon. 2024; 10(22):e40342.

PMID: 39619596 PMC: 11605402. DOI: 10.1016/j.heliyon.2024.e40342.


satellite phage Aci01-2-Phanie depends on a helper myophage for its multiplication.

Pourcel C, Essoh C, Ouldali M, Tavares P J Virol. 2024; 98(7):e0066724.

PMID: 38829140 PMC: 11264900. DOI: 10.1128/jvi.00667-24.


pyCapsid: identifying dominant dynamics and quasi-rigid mechanical units in protein shells.

Brown C, Agarwal A, Luque A Bioinformatics. 2023; 40(1).

PMID: 38113434 PMC: 10786678. DOI: 10.1093/bioinformatics/btad761.


References
1.
Brandes N, Linial M . Gene overlapping and size constraints in the viral world. Biol Direct. 2016; 11:26. PMC: 4875738. DOI: 10.1186/s13062-016-0128-3. View

2.
Gertsman I, Gan L, Guttman M, Lee K, Speir J, Duda R . An unexpected twist in viral capsid maturation. Nature. 2009; 458(7238):646-50. PMC: 2765791. DOI: 10.1038/nature07686. View

3.
Gregory A, Zablocki O, Zayed A, Howell A, Bolduc B, Sullivan M . The Gut Virome Database Reveals Age-Dependent Patterns of Virome Diversity in the Human Gut. Cell Host Microbe. 2020; 28(5):724-740.e8. PMC: 7443397. DOI: 10.1016/j.chom.2020.08.003. View

4.
Shamash M, Maurice C . Phages in the infant gut: a framework for virome development during early life. ISME J. 2021; 16(2):323-330. PMC: 8776839. DOI: 10.1038/s41396-021-01090-x. View

5.
Iyer L, Anantharaman V, Krishnan A, Burroughs A, Aravind L . Jumbo Phages: A Comparative Genomic Overview of Core Functions and Adaptions for Biological Conflicts. Viruses. 2021; 13(1). PMC: 7824862. DOI: 10.3390/v13010063. View