» Articles » PMID: 37995286

DeepFold: Enhancing Protein Structure Prediction Through Optimized Loss Functions, Improved Template Features, and Re-optimized Energy Function

Abstract

Motivation: Predicting protein structures with high accuracy is a critical challenge for the broad community of life sciences and industry. Despite progress made by deep neural networks like AlphaFold2, there is a need for further improvements in the quality of detailed structures, such as side-chains, along with protein backbone structures.

Results: Building upon the successes of AlphaFold2, the modifications we made include changing the losses of side-chain torsion angles and frame aligned point error, adding loss functions for side chain confidence and secondary structure prediction, and replacing template feature generation with a new alignment method based on conditional random fields. We also performed re-optimization by conformational space annealing using a molecular mechanics energy function which integrates the potential energies obtained from distogram and side-chain prediction. In the CASP15 blind test for single protein and domain modeling (109 domains), DeepFold ranked fourth among 132 groups with improvements in the details of the structure in terms of backbone, side-chain, and Molprobity. In terms of protein backbone accuracy, DeepFold achieved a median GDT-TS score of 88.64 compared with 85.88 of AlphaFold2. For TBM-easy/hard targets, DeepFold ranked at the top based on Z-scores for GDT-TS. This shows its practical value to the structural biology community, which demands highly accurate structures. In addition, a thorough analysis of 55 domains from 39 targets with publicly available structures indicates that DeepFold shows superior side-chain accuracy and Molprobity scores among the top-performing groups.

Availability And Implementation: DeepFold tools are open-source software available at https://github.com/newtonjoo/deepfold.

Citing Articles

Novel lineage of anelloviruses with large genomes identified in dolphins.

De Koch M, Krupovic M, Fielding R, Smith K, Schiavone K, Hall K J Virol. 2024; 99(1):e0137024.

PMID: 39665547 PMC: 11784456. DOI: 10.1128/jvi.01370-24.


Easy and accurate protein structure prediction using ColabFold.

Kim G, Lee S, Levy Karin E, Kim H, Moriwaki Y, Ovchinnikov S Nat Protoc. 2024; 20(3):620-642.

PMID: 39402428 DOI: 10.1038/s41596-024-01060-5.


Natural history of eukaryotic DNA viruses with double jelly-roll major capsid proteins.

Krupovic M, Kuhn J, Fischer M, Koonin E Proc Natl Acad Sci U S A. 2024; 121(23):e2405771121.

PMID: 38805295 PMC: 11161782. DOI: 10.1073/pnas.2405771121.


Natural history of eukaryotic DNA viruses with double jelly-roll major capsid proteins.

Krupovic M, Kuhn J, Fischer M, Koonin E bioRxiv. 2024; .

PMID: 38712159 PMC: 11071308. DOI: 10.1101/2024.03.18.585575.


PS-GO parametric protein search engine.

Mi Y, Marcu S, Tabirca S, Yallapragada V Comput Struct Biotechnol J. 2024; 23:1499-1509.

PMID: 38633387 PMC: 11021831. DOI: 10.1016/j.csbj.2024.04.003.

References
1.
Watson J, Juergens D, Bennett N, Trippe B, Yim J, Eisenach H . De novo design of protein structure and function with RFdiffusion. Nature. 2023; 620(7976):1089-1100. PMC: 10468394. DOI: 10.1038/s41586-023-06415-8. View

2.
Croll T, Sammito M, Kryshtafovych A, Read R . Evaluation of template-based modeling in CASP13. Proteins. 2019; 87(12):1113-1127. PMC: 6851432. DOI: 10.1002/prot.25800. View

3.
Sali A, Blundell T . Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993; 234(3):779-815. DOI: 10.1006/jmbi.1993.1626. View

4.
Baek M, DiMaio F, Anishchenko I, Dauparas J, Ovchinnikov S, Lee G . Accurate prediction of protein structures and interactions using a three-track neural network. Science. 2021; 373(6557):871-876. PMC: 7612213. DOI: 10.1126/science.abj8754. View

5.
Mirdita M, Schutze K, Moriwaki Y, Heo L, Ovchinnikov S, Steinegger M . ColabFold: making protein folding accessible to all. Nat Methods. 2022; 19(6):679-682. PMC: 9184281. DOI: 10.1038/s41592-022-01488-1. View