» Articles » PMID: 31005579

End-to-End Differentiable Learning of Protein Structure

Overview
Journal Cell Syst
Publisher Cell Press
Date 2019 Apr 22
PMID 31005579
Citations 124
Authors
Affiliations
Soon will be listed here.
Abstract

Predicting protein structure from sequence is a central challenge of biochemistry. Co-evolution methods show promise, but an explicit sequence-to-structure map remains elusive. Advances in deep learning that replace complex, human-designed pipelines with differentiable models optimized end to end suggest the potential benefits of similarly reformulating structure prediction. Here, we introduce an end-to-end differentiable model for protein structure learning. The model couples local and global protein structure via geometric units that optimize global geometry without violating local covalent chemistry. We test our model using two challenging tasks: predicting novel folds without co-evolutionary data and predicting known folds without structural templates. In the first task, the model achieves state-of-the-art accuracy, and in the second, it comes within 1-2 Å; competing methods using co-evolution and experimental templates have been refined over many years, and it is likely that the differentiable approach has substantial room for further improvement, with applications ranging from drug discovery to protein design.

Citing Articles

Advances in artificial intelligence-based technologies for increasing the quality of medical products.

Srivastava N, Verma S, Singh A, Shukla P, Singh Y, Oza A Daru. 2024; 33(1):1.

PMID: 39613923 PMC: 11607247. DOI: 10.1007/s40199-024-00548-5.


Beyond AlphaFold2: The Impact of AI for the Further Improvement of Protein Structure Prediction.

Genc A, McGuffin L Methods Mol Biol. 2024; 2867:121-139.

PMID: 39576578 DOI: 10.1007/978-1-0716-4196-5_7.


How the technologies behind self-driving cars, social networks, ChatGPT, and DALL-E2 are changing structural biology.

Bochtler M Bioessays. 2024; 47(1):e2400155.

PMID: 39404756 PMC: 11662154. DOI: 10.1002/bies.202400155.


MCNN_MC: Computational Prediction of Mitochondrial Carriers and Investigation of Bongkrekic Acid Toxicity Using Protein Language Models and Convolutional Neural Networks.

Malik M, Chang Y, Liu Y, Le V, Ou Y J Chem Inf Model. 2024; 64(24):9125-9134.

PMID: 39133248 PMC: 11683872. DOI: 10.1021/acs.jcim.4c00961.


Structure-based protein and small molecule generation using EGNN and diffusion models: A comprehensive review.

Soleymani F, Paquet E, Viktor H, Michalowski W Comput Struct Biotechnol J. 2024; 23:2779-2797.

PMID: 39050782 PMC: 11268121. DOI: 10.1016/j.csbj.2024.06.021.


References
1.
Xu J, Zhang Y . How significant is a protein structure similarity with TM-score = 0.5?. Bioinformatics. 2010; 26(7):889-95. PMC: 2913670. DOI: 10.1093/bioinformatics/btq066. View

2.
Kryshtafovych A, Monastyrskyy B, Fidelis K, Moult J, Schwede T, Tramontano A . Evaluation of the template-based modeling in CASP12. Proteins. 2017; 86 Suppl 1:321-334. PMC: 5877821. DOI: 10.1002/prot.25425. View

3.
Leaver-Fay A, Tyka M, Lewis S, Lange O, Thompson J, Jacak R . ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules. Methods Enzymol. 2010; 487:545-74. PMC: 4083816. DOI: 10.1016/B978-0-12-381270-4.00019-6. View

4.
Hochreiter S, Schmidhuber J . Long short-term memory. Neural Comput. 1997; 9(8):1735-80. DOI: 10.1162/neco.1997.9.8.1735. View

5.
Schaarschmidt J, Monastyrskyy B, Kryshtafovych A, Bonvin A . Assessment of contact predictions in CASP12: Co-evolution and deep learning coming of age. Proteins. 2017; 86 Suppl 1:51-66. PMC: 5820169. DOI: 10.1002/prot.25407. View