The Interplay of SARS-CoV-2 Evolution and Constraints Imposed by the Structure and Functionality of Its Proteins
Overview
Affiliations
The unprecedented pace of the sequencing of the SARS-CoV-2 virus genomes provides us with unique information about the genetic changes in a single pathogen during ongoing pandemic. By the analysis of close to 200,000 genomes we show that the patterns of the SARS-CoV-2 virus mutations along its genome are closely correlated with the structural and functional features of the encoded proteins. Requirements of foldability of proteins' 3D structures and the conservation of their key functional regions, such as protein-protein interaction interfaces, are the dominant factors driving evolutionary selection in protein-coding genes. At the same time, avoidance of the host immunity leads to the abundance of mutations in other regions, resulting in high variability of the missense mutation rate along the genome. "Unexplained" peaks and valleys in the mutation rate provide hints on function for yet uncharacterized genomic regions and specific protein structural and functional features they code for. Some of these observations have immediate practical implications for the selection of target regions for PCR-based COVID-19 tests and for evaluating the risk of mutations in epitopes targeted by specific antibodies and vaccine design strategies.
Torsional twist of the SARS-CoV and SARS-CoV-2 SUD-N and SUD-M domains.
Rosas-Lemus M, Minasov G, Brunzelle J, Taha T, Lemak S, Yin S Protein Sci. 2025; 34(3):e70050.
PMID: 39969084 PMC: 11837046. DOI: 10.1002/pro.70050.
Barozi V, Chakraborty S, Govender S, Morgan E, Ramahala R, Graham S Comput Struct Biotechnol J. 2024; 23:3800-3816.
PMID: 39525081 PMC: 11550722. DOI: 10.1016/j.csbj.2024.10.031.
Torsional Twist of the SARS-CoV and SARS-CoV-2 SUD-N and SUD-M domains.
Rosas-Lemus M, Minasov G, Brunzelle J, Taha T, Lemak S, Yin S bioRxiv. 2024; .
PMID: 39185168 PMC: 11343135. DOI: 10.1101/2024.08.13.607777.
Dynamic expedition of leading mutations in SARS-CoV-2 spike glycoproteins.
Hasan M, He Z, Jia M, Leung A, Natarajan K, Xu W Comput Struct Biotechnol J. 2024; 23:2407-2417.
PMID: 38882678 PMC: 11176665. DOI: 10.1016/j.csbj.2024.05.037.
Caobi A, Saeed M Curr Opin Microbiol. 2024; 79:102454.
PMID: 38518551 PMC: 11162932. DOI: 10.1016/j.mib.2024.102454.