» Articles » PMID: 37695768

Pathfinder: Protein Folding Pathway Prediction Based on Conformational Sampling

Overview
Specialty Biology
Date 2023 Sep 11
PMID 37695768
Authors
Affiliations
Soon will be listed here.
Abstract

The study of protein folding mechanism is a challenge in molecular biology, which is of great significance for revealing the movement rules of biological macromolecules, understanding the pathogenic mechanism of folding diseases, and designing protein engineering materials. Based on the hypothesis that the conformational sampling trajectory contain the information of folding pathway, we propose a protein folding pathway prediction algorithm named Pathfinder. Firstly, Pathfinder performs large-scale sampling of the conformational space and clusters the decoys obtained in the sampling. The heterogeneous conformations obtained by clustering are named seed states. Then, a resampling algorithm that is not constrained by the local energy basin is designed to obtain the transition probabilities of seed states. Finally, protein folding pathways are inferred from the maximum transition probabilities of seed states. The proposed Pathfinder is tested on our developed test set (34 proteins). For 11 widely studied proteins, we correctly predicted their folding pathways and specifically analyzed 5 of them. For 13 proteins, we predicted their folding pathways to be further verified by biological experiments. For 6 proteins, we analyzed the reasons for the low prediction accuracy. For the other 4 proteins without biological experiment results, potential folding pathways were predicted to provide new insights into protein folding mechanism. The results reveal that structural analogs may have different folding pathways to express different biological functions, homologous proteins may contain common folding pathways, and α-helices may be more prone to early protein folding than β-strands.

Citing Articles

FoldPAthreader: predicting protein folding pathway using a novel folding force field model derived from known protein universe.

Zhao K, Zhao P, Wang S, Xia Y, Zhang G Genome Biol. 2024; 25(1):152.

PMID: 38862984 PMC: 11167914. DOI: 10.1186/s13059-024-03291-x.

References
1.
Selkoe D, Hardy J . The amyloid hypothesis of Alzheimer's disease at 25 years. EMBO Mol Med. 2016; 8(6):595-608. PMC: 4888851. DOI: 10.15252/emmm.201606210. View

2.
Baek M, DiMaio F, Anishchenko I, Dauparas J, Ovchinnikov S, Lee G . Accurate prediction of protein structures and interactions using a three-track neural network. Science. 2021; 373(6557):871-876. PMC: 7612213. DOI: 10.1126/science.abj8754. View

3.
Kmiecik S, Kolinski A . Folding pathway of the b1 domain of protein G explored by multiscale modeling. Biophys J. 2007; 94(3):726-36. PMC: 2186257. DOI: 10.1529/biophysj.107.116095. View

4.
Maxwell K, Wildes D, Zarrine-Afsar A, de los Rios M, Brown A, Friel C . Protein folding: defining a "standard" set of experimental conditions and a preliminary kinetic data set of two-state proteins. Protein Sci. 2005; 14(3):602-16. PMC: 2279278. DOI: 10.1110/ps.041205405. View

5.
Wang J, Panagiotou E . The protein folding rate and the geometry and topology of the native state. Sci Rep. 2022; 12(1):6384. PMC: 9013383. DOI: 10.1038/s41598-022-09924-0. View