Protein Structure Prediction Using Deep Learning Distance and Hydrogen-bonding Restraints in CASP14
Overview
Authors
Affiliations
In this article, we report 3D structure prediction results by two of our best server groups ("Zhang-Server" and "QUARK") in CASP14. These two servers were built based on the D-I-TASSER and D-QUARK algorithms, which integrated four newly developed components into the classical protein folding pipelines, I-TASSER and QUARK, respectively. The new components include: (a) a new multiple sequence alignment (MSA) collection tool, DeepMSA2, which is extended from the DeepMSA program; (b) a contact-based domain boundary prediction algorithm, FUpred, to detect protein domain boundaries; (c) a residual convolutional neural network-based method, DeepPotential, to predict multiple spatial restraints by co-evolutionary features derived from the MSA; and (d) optimized spatial restraint energy potentials to guide the structure assembly simulations. For 37 FM targets, the average TM-scores of the first models produced by D-I-TASSER and D-QUARK were 96% and 112% higher than those constructed by I-TASSER and QUARK, respectively. The data analysis indicates noticeable improvements produced by each of the four new components, especially for the newly added spatial restraints from DeepPotential and the well-tuned force field that combines spatial restraints, threading templates, and generic knowledge-based potentials. However, challenges still exist in the current pipelines. These include difficulties in modeling multi-domain proteins due to low accuracy in inter-domain distance prediction and modeling protein domains from oligomer complexes, as the co-evolutionary analysis cannot distinguish inter-chain and intra-chain distances. Specifically tuning the deep learning-based predictors for multi-domain targets and protein complexes may be helpful to address these issues.
In silico design of multi-epitope adhesin protein vaccines.
Pillay K, Chiliza T, Senzani S, Pillay B, Pillay M Heliyon. 2024; 10(18):e37536.
PMID: 39323805 PMC: 11422057. DOI: 10.1016/j.heliyon.2024.e37536.
Computational Approaches to Predict Protein-Protein Interactions in Crowded Cellular Environments.
Grassmann G, Miotto M, Desantis F, Di Rienzo L, Tartaglia G, Pastore A Chem Rev. 2024; 124(7):3932-3977.
PMID: 38535831 PMC: 11009965. DOI: 10.1021/acs.chemrev.3c00550.
Recent Progress of Protein Tertiary Structure Prediction.
Wuyun Q, Chen Y, Shen Y, Cao Y, Hu G, Cui W Molecules. 2024; 29(4).
PMID: 38398585 PMC: 10893003. DOI: 10.3390/molecules29040832.
Roy B, Choi J, Fuchs M Biomolecules. 2024; 14(1).
PMID: 38254661 PMC: 10813169. DOI: 10.3390/biom14010062.
Yin R, Pierce B Protein Sci. 2023; 33(1):e4865.
PMID: 38073135 PMC: 10751731. DOI: 10.1002/pro.4865.