» Articles » PMID: 34769173

Protein Design with Deep Learning

Overview
Journal Int J Mol Sci
Publisher MDPI
Date 2021 Nov 13
PMID 34769173
Citations 21
Authors
Affiliations
Soon will be listed here.
Abstract

Computational Protein Design (CPD) has produced impressive results for engineering new proteins, resulting in a wide variety of applications. In the past few years, various efforts have aimed at replacing or improving existing design methods using Deep Learning technology to leverage the amount of publicly available protein data. Deep Learning (DL) is a very powerful tool to extract patterns from raw data, provided that data are formatted as mathematical objects and the architecture processing them is well suited to the targeted problem. In the case of protein data, specific representations are needed for both the amino acid sequence and the protein structure in order to capture respectively 1D and 3D information. As no consensus has been reached about the most suitable representations, this review describes the representations used so far, discusses their strengths and weaknesses, and details their associated DL architecture for design and related tasks.

Citing Articles

IDP-Bert: Predicting Properties of Intrinsically Disordered Proteins Using Large Language Models.

Mollaei P, Sadasivam D, Guntuboina C, Barati Farimani A J Phys Chem B. 2024; 128(49):12030-12037.

PMID: 39586094 PMC: 11647883. DOI: 10.1021/acs.jpcb.4c02507.


Deep learning for discriminating non-trivial conformational changes in molecular dynamics simulations of SARS-CoV-2 spike-ACE2.

Moraes Dos Santos L, Gutembergue de Mendonca J, Jeronimo Gomes Lobo Y, Henrique Franca de Lima L, Bruno Rocha G, C de Melo-Minardi R Sci Rep. 2024; 14(1):22639.

PMID: 39349594 PMC: 11443059. DOI: 10.1038/s41598-024-72842-w.


Structure-based protein and small molecule generation using EGNN and diffusion models: A comprehensive review.

Soleymani F, Paquet E, Viktor H, Michalowski W Comput Struct Biotechnol J. 2024; 23:2779-2797.

PMID: 39050782 PMC: 11268121. DOI: 10.1016/j.csbj.2024.06.021.


SPDesign: protein sequence designer based on structural sequence profile using ultrafast shape recognition.

Wang H, Liu D, Zhao K, Wang Y, Zhang G Brief Bioinform. 2024; 25(3).

PMID: 38600663 PMC: 11006797. DOI: 10.1093/bib/bbae146.


Graphormer supervised de novo protein design method and function validation.

Mu J, Li Z, Zhang B, Zhang Q, Iqbal J, Wadood A Brief Bioinform. 2024; 25(3).

PMID: 38557677 PMC: 10982952. DOI: 10.1093/bib/bbae135.


References
1.
Simoncini D, Allouche D, de Givry S, Delmas C, Barbe S, Schiex T . Guaranteed Discrete Energy Optimization on Large Protein Design Problems. J Chem Theory Comput. 2015; 11(12):5980-9. DOI: 10.1021/acs.jctc.5b00594. View

2.
Shapovalov M, Dunbrack Jr R . A smoothed backbone-dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions. Structure. 2011; 19(6):844-58. PMC: 3118414. DOI: 10.1016/j.str.2011.03.019. View

3.
Adhikari B, Cheng J . CONFOLD2: improved contact-driven ab initio protein structure modeling. BMC Bioinformatics. 2018; 19(1):22. PMC: 5784681. DOI: 10.1186/s12859-018-2032-6. View

4.
Gao W, Mahajan S, Sulam J, Gray J . Deep Learning in Protein Structural Modeling and Design. Patterns (N Y). 2020; 1(9):100142. PMC: 7733882. DOI: 10.1016/j.patter.2020.100142. View

5.
Pearce R, Zhang Y . Deep learning techniques have significantly impacted protein structure prediction and protein design. Curr Opin Struct Biol. 2021; 68:194-207. PMC: 8222070. DOI: 10.1016/j.sbi.2021.01.007. View