Inference of Epistatic Effects Leading to Entrenchment and Drug Resistance in HIV-1 Protease
Overview
Affiliations
Understanding the complex mutation patterns that give rise to drug resistant viral strains provides a foundation for developing more effective treatment strategies for HIV/AIDS. Multiple sequence alignments of drug-experienced HIV-1 protease sequences contain networks of many pair correlations which can be used to build a (Potts) Hamiltonian model of these mutation patterns. Using this Hamiltonian model, we translate HIV-1 protease sequence covariation data into quantitative predictions for the probability of observing specific mutation patterns which are in agreement with the observed sequence statistics. We find that the statistical energies of the Potts model are correlated with the fitness of individual proteins containing therapy-associated mutations as estimated by in vitro measurements of protein stability and viral infectivity. We show that the penalty for acquiring primary resistance mutations depends on the epistatic interactions with the sequence background. Primary mutations which lead to drug resistance can become highly advantageous (or entrenched) by the complex mutation patterns which arise in response to drug therapy despite being destabilizing in the wildtype background. Anticipating epistatic effects is important for the design of future protease inhibitor therapies.
Using AlphaFold2 to Predict the Conformations of Side Chains in Folded Proteins.
Maisuradze G, Thakur A, Khatri K, Haldane A, Levy R bioRxiv. 2025; .
PMID: 39990457 PMC: 11844428. DOI: 10.1101/2025.02.10.637534.
A fundamental and theoretical framework for mutation interactions and epistasis.
Giacoletto C, Benjamin R, Rotter J, Schiller M Genomics. 2024; 116(6):110963.
PMID: 39561884 PMC: 11752442. DOI: 10.1016/j.ygeno.2024.110963.
Machine learning in biological physics: From biomolecular prediction to design.
Martin J, Lequerica Mateos M, Onuchic J, Coluzza I, Morcos F Proc Natl Acad Sci U S A. 2024; 121(27):e2311807121.
PMID: 38913893 PMC: 11228481. DOI: 10.1073/pnas.2311807121.
Biswas A, Choudhuri I, Arnold E, Lyumkis D, Haldane A, Levy R Proc Natl Acad Sci U S A. 2024; 121(15):e2316662121.
PMID: 38557187 PMC: 11009627. DOI: 10.1073/pnas.2316662121.
pycofitness-Evaluating the fitness landscape of RNA and protein sequences.
Pucci F, Zerihun M, Rooman M, Schug A Bioinformatics. 2024; 40(2).
PMID: 38335928 PMC: 10881095. DOI: 10.1093/bioinformatics/btae074.