SPIN2: Predicting Sequence Profiles from Protein Structures Using Deep Neural Networks
Overview
Authors
Affiliations
Designing protein sequences that can fold into a given structure is a well-known inverse protein-folding problem. One important characteristic to attain for a protein design program is the ability to recover wild-type sequences given their native backbone structures. The highest average sequence identity accuracy achieved by current protein-design programs in this problem is around 30%, achieved by our previous system, SPIN. SPIN is a program that predicts sequences compatible with a provided structure using a neural network with fragment-based local and energy-based nonlocal profiles. Our new model, SPIN2, uses a deep neural network and additional structural features to improve on SPIN. SPIN2 achieves over 34% in sequence recovery in 10-fold cross-validation and independent tests, a 4% improvement over the previous version. The sequence profiles generated from SPIN2 are expected to be useful for improving existing fold recognition and protein design techniques. SPIN2 is available at http://sparks-lab.org.
Li D, Zhu Y, Zhang W, Liu J, Yang X, Liu Z Interdiscip Sci. 2024; 17(1):101-113.
PMID: 39367992 DOI: 10.1007/s12539-024-00662-7.
Exploring the potential of structure-based deep learning approaches for T cell receptor design.
Ribeiro-Filho H, Jara G, Guerra J, Cheung M, Felbinger N, Pereira J PLoS Comput Biol. 2024; 20(9):e1012489.
PMID: 39348412 PMC: 11466415. DOI: 10.1371/journal.pcbi.1012489.
Wang H, Liu D, Zhao K, Wang Y, Zhang G Brief Bioinform. 2024; 25(3).
PMID: 38600663 PMC: 11006797. DOI: 10.1093/bib/bbae146.
Graphormer supervised de novo protein design method and function validation.
Mu J, Li Z, Zhang B, Zhang Q, Iqbal J, Wadood A Brief Bioinform. 2024; 25(3).
PMID: 38557677 PMC: 10982952. DOI: 10.1093/bib/bbae135.
DIProT: A deep learning based interactive toolkit for efficient and effective Protein design.
He J, Wu W, Wang X Synth Syst Biotechnol. 2024; 9(2):217-222.
PMID: 38385151 PMC: 10876589. DOI: 10.1016/j.synbio.2024.01.011.