How Structural and Physicochemical Determinants Shape Sequence Constraints in a Functional Enzyme
Overview
Affiliations
The need for interfacing structural biology and biophysics to molecular evolution is being increasingly recognized. One part of the big problem is to understand how physics and chemistry shape the sequence space available to functional proteins, while satisfying the needs of biology. Here we present a quantitative, structure-based analysis of a high-resolution map describing the tolerance to all substitutions in all positions of a functional enzyme, namely a TEM lactamase previously studied through deep sequencing of mutants growing in competition experiments with selection against ampicillin. Substitutions are rarely observed within 7 Å of the active site, a stringency that is relaxed slowly and extends up to 15-20 Å, with buried residues being especially sensitive. Substitution patterns in over one third of the residues can be quantitatively modeled by monotonic dependencies on amino acid descriptors and predictions of changes in folding stability. Amino acid volume and steric hindrance shape constraints on the protein core; hydrophobicity and solubility shape constraints on hydrophobic clusters underneath the surface, and on salt bridges and polar networks at the protein surface together with charge and hydrogen bonding capacity. Amino acid solubility, flexibility and conformational descriptors also provide additional constraints at many locations. These findings provide fundamental insights into the chemistry underlying protein evolution and design, by quantitating links between sequence and different protein traits, illuminating subtle and unexpected sequence-trait relationships and pinpointing what traits are sacrificed upon gain-of-function mutation.
Joshi D, Pradhan S, Sajeed R, Srinivasan R, Rana S Hum Genet. 2025; .
PMID: 39869148 DOI: 10.1007/s00439-025-02727-z.
A look beyond the QR code of SNARE proteins.
Yadav D, Hacisuleyman A, Dergai M, Khalifeh D, Abriata L, Dal Peraro M Protein Sci. 2024; 33(9):e5158.
PMID: 39180485 PMC: 11344281. DOI: 10.1002/pro.5158.
Context-aware geometric deep learning for protein sequence design.
Krapp L, Meireles F, Abriata L, Devillard J, Vacle S, Marcaida M Nat Commun. 2024; 15(1):6273.
PMID: 39054322 PMC: 11272779. DOI: 10.1038/s41467-024-50571-y.
Substitution Models of Protein Evolution with Selection on Enzymatic Activity.
Ferreiro D, Khalil R, Sousa S, Arenas M Mol Biol Evol. 2024; 41(2).
PMID: 38314876 PMC: 10873502. DOI: 10.1093/molbev/msae026.
Machine Learning for Efficient Prediction of Protein Redox Potential: The Flavoproteins Case.
Galuzzi B, Mirarchi A, Vigano E, De Gioia L, Damiani C, Arrigoni F J Chem Inf Model. 2022; 62(19):4748-4759.
PMID: 36126254 PMC: 9554915. DOI: 10.1021/acs.jcim.2c00858.