» Articles » PMID: 37108059

Enhancing Conformational Sampling for Intrinsically Disordered and Ordered Proteins by Variational Autoencoder

Overview
Journal Int J Mol Sci
Publisher MDPI
Date 2023 Apr 28
PMID 37108059
Authors
Affiliations
Soon will be listed here.
Abstract

Intrinsically disordered proteins (IDPs) account for more than 50% of the human proteome and are closely associated with tumors, cardiovascular diseases, and neurodegeneration, which have no fixed three-dimensional structure under physiological conditions. Due to the characteristic of conformational diversity, conventional experimental methods of structural biology, such as NMR, X-ray diffraction, and CryoEM, are unable to capture conformational ensembles. Molecular dynamics (MD) simulation can sample the dynamic conformations at the atomic level, which has become an effective method for studying the structure and function of IDPs. However, the high computational cost prevents MD simulations from being widely used for IDPs conformational sampling. In recent years, significant progress has been made in artificial intelligence, which makes it possible to solve the conformational reconstruction problem of IDP with fewer computational resources. Here, based on short MD simulations of different IDPs systems, we use variational autoencoders (VAEs) to achieve the generative reconstruction of IDPs structures and include a wider range of sampled conformations from longer simulations. Compared with the generative autoencoder (AEs), VAEs add an inference layer between the encoder and decoder in the latent space, which can cover the conformational landscape of IDPs more comprehensively and achieve the effect of enhanced sampling. Through experimental verification, the Cα RMSD between VAE-generated and MD simulation sampling conformations in the 5 IDPs test systems was significantly lower than that of AE. The Spearman correlation coefficient on the structure was higher than that of AE. VAE can also achieve excellent performance regarding structured proteins. In summary, VAEs can be used to effectively sample protein structures.

Citing Articles

Sampling Conformational Ensembles of Highly Dynamic Proteins via Generative Deep Learning.

Ruzmetov T, Hung T, Jonnalagedda S, Chen S, Fasihianifard P, Guo Z Res Sq. 2024; .

PMID: 38978607 PMC: 11230488. DOI: 10.21203/rs.3.rs-4301803/v1.


Molecular simulations integrated with experiments for probing the interaction dynamics and binding mechanisms of intrinsically disordered proteins.

Ghosh C, Nagpal S, Munoz V Curr Opin Struct Biol. 2023; 84:102756.

PMID: 38118365 PMC: 11242915. DOI: 10.1016/j.sbi.2023.102756.


Phanto-IDP: compact model for precise intrinsically disordered protein backbone generation and enhanced sampling.

Zhu J, Li Z, Tong H, Lu Z, Zhang N, Wei T Brief Bioinform. 2023; 25(1).

PMID: 38018910 PMC: 10783862. DOI: 10.1093/bib/bbad429.

References
1.
Morar A, Olteanu A, Young G, Pielak G . Solvent-induced collapse of alpha-synuclein and acid-denatured cytochrome c. Protein Sci. 2001; 10(11):2195-9. PMC: 2374057. DOI: 10.1110/ps.24301. View

2.
Ketkaew R, Creazzo F, Luber S . Machine Learning-Assisted Discovery of Hidden States in Expanded Free Energy Space. J Phys Chem Lett. 2022; 13(7):1797-1805. DOI: 10.1021/acs.jpclett.1c04004. View

3.
Allison J . Computational methods for exploring protein conformations. Biochem Soc Trans. 2020; 48(4):1707-1724. PMC: 7458412. DOI: 10.1042/BST20200193. View

4.
Kang L, Janowska M, Moriarty G, Baum J . Mechanistic insight into the relationship between N-terminal acetylation of α-synuclein and fibril formation rates by NMR and fluorescence. PLoS One. 2013; 8(9):e75018. PMC: 3776725. DOI: 10.1371/journal.pone.0075018. View

5.
Song D, Liu H, Luo R, Chen H . Environment-Specific Force Field for Intrinsically Disordered and Ordered Proteins. J Chem Inf Model. 2020; 60(4):2257-2267. PMC: 10449432. DOI: 10.1021/acs.jcim.0c00059. View