Alternate States of Proteins Revealed by Detailed Energy Landscape Mapping
Overview
Molecular Biology
Authors
Affiliations
What conformations do protein molecules populate in solution? Crystallography provides a high-resolution description of protein structure in the crystal environment, while NMR describes structure in solution but using less data. NMR structures display more variability, but is this because crystal contacts are absent or because of fewer data constraints? Here we report unexpected insight into this issue obtained through analysis of detailed protein energy landscapes generated by large-scale, native-enhanced sampling of conformational space with Rosetta@home for 111 protein domains. In the absence of tightly associating binding partners or ligands, the lowest-energy Rosetta models were nearly all <2.5 Å C(α)RMSD from the experimental structure; this result demonstrates that structure prediction accuracy for globular proteins is limited mainly by the ability to sample close to the native structure. While the lowest-energy models are similar to deposited structures, they are not identical; the largest deviations are most often in regions involved in ligand, quaternary, or crystal contacts. For ligand binding proteins, the low energy models may resemble the apo structures, and for oligomeric proteins, the monomeric assembly intermediates. The deviations between the low energy models and crystal structures largely disappear when landscapes are computed in the context of the crystal lattice or multimer. The computed low-energy ensembles, with tight crystal-structure-like packing in the core, but more NMR-structure-like variability in loops, may in some cases resemble the native state ensembles of proteins better than individual crystal or NMR structures, and can suggest experimentally testable hypotheses relating alternative states and structural heterogeneity to function.
In-cell Structure and Variability of Pyrenoid Rubisco.
Elad N, Hou Z, Dumoux M, Ramezani A, Perilla J, Zhang P bioRxiv. 2025; .
PMID: 40060630 PMC: 11888406. DOI: 10.1101/2025.02.27.640608.
TRain: T-cell receptor automated immunoinformatics.
Seamann A, Bennett-Boehm M, Ehrlich R, Gil A, Selin L, Ghersi D BMC Bioinformatics. 2025; 26(1):76.
PMID: 40050726 PMC: 11887255. DOI: 10.1186/s12859-025-06074-8.
Structure-based Design of Chimeric Influenza Hemagglutinins to Elicit Cross-group Immunity.
Castro K, Ayardulabi R, Wehrle S, Cui H, Georgeon S, Schmidt J bioRxiv. 2025; .
PMID: 40027756 PMC: 11870435. DOI: 10.1101/2024.12.17.628867.
Molecular basis of the CYFIP2 and NCKAP1 autism-linked variants in the WAVE regulatory complex.
Xie S, Zuo K, De Rubeis S, Ruggerone P, Carloni P Protein Sci. 2024; 34(1):e5238.
PMID: 39660913 PMC: 11632847. DOI: 10.1002/pro.5238.
EuDockScore: Euclidean graph neural networks for scoring protein-protein interfaces.
McFee M, Kim J, Kim P Bioinformatics. 2024; 40(11).
PMID: 39441796 PMC: 11543620. DOI: 10.1093/bioinformatics/btae636.