» Articles » PMID: 32206742

Sonification Based Protein Design Using Artificial Intelligence, Structure Prediction, and Analysis Using Molecular Modeling

Overview
Journal APL Bioeng
Date 2020 Mar 25
PMID 32206742
Citations 9
Authors
Affiliations
Soon will be listed here.
Abstract

We report the use of a deep learning model to design proteins, based on the interplay of elementary building blocks via hierarchical patterns. The deep neural network model is based on translating protein sequences and structural information into a musical score that features different pitches for each of the amino acids, and variations in note length and note volume reflecting secondary structure information and information about the chain length and distinct protein molecules. We train a deep learning model whose architecture is composed of several long short-term memory units from data consisting of musical representations of proteins classified by certain features, focused here on alpha-helix rich proteins. Using the deep learning model, we then generate musical scores and translate the pitch information and chain lengths into sequences of amino acids. We use a Basic Local Alignment Search Tool to compare the predicted amino acid sequences against known proteins, and estimate folded protein structures using the Optimized protein fold RecognitION method (ORION) and MODELLER. We find that the method proposed here can be used to design proteins that do not exist yet, and that the designed proteins fold into specified secondary structures. We validate the newly predicted protein by molecular dynamics equilibration in explicit water and subsequent characterization using a normal mode analysis. The method provides a tool to design novel protein materials that could find useful applications as materials in biology, medicine, and engineering.

Citing Articles

ProtAgents: protein discovery large language model multi-agent collaborations combining physics and machine learning.

Ghafarollahi A, Buehler M Digit Discov. 2024; 3(7):1389-1409.

PMID: 38993729 PMC: 11235180. DOI: 10.1039/d4dd00013g.


Current Advancement and Future Prospects: Biomedical Nanoengineering.

Singh S, Sahani H Curr Radiopharm. 2023; 17(2):120-137.

PMID: 38058099 DOI: 10.2174/0118744710274376231123063135.


Editorial: Sonification, aesthetic representation of physical quantities.

Minciacchi D, Bravi R, Rosenboom D Front Neurosci. 2023; 17:1162383.

PMID: 37008216 PMC: 10064135. DOI: 10.3389/fnins.2023.1162383.


Protein Science Meets Artificial Intelligence: A Systematic Review and a Biochemical Meta-Analysis of an Inter-Field.

Villalobos-Alva J, Ochoa-Toledo L, Villalobos-Alva M, Aliseda A, Perez-Escamirosa F, Altamirano-Bustamante N Front Bioeng Biotechnol. 2022; 10:788300.

PMID: 35875501 PMC: 9301016. DOI: 10.3389/fbioe.2022.788300.


Construction of Music Intelligent Creation Model Based on Convolutional Neural Network.

Chen J Comput Intell Neurosci. 2022; 2022:2854066.

PMID: 35837219 PMC: 9276503. DOI: 10.1155/2022/2854066.


References
1.
Qin Z, Fabre A, Buehler M . Structure and mechanism of maximum stability of isolated alpha-helical protein domains at a critical length scale. Eur Phys J E Soft Matter. 2013; 36(5):53. DOI: 10.1140/epje/i2013-13053-8. View

2.
Cranford S, de Boer J, van Blitterswijk C, Buehler M . Materiomics: an -omics approach to biomaterials research. Adv Mater. 2013; 25(6):802-24. DOI: 10.1002/adma.201202553. View

3.
Ebrahimi D, Tokareva O, Rim N, Wong J, Kaplan D, Buehler M . Silk-Its Mysteries, How It Is Made, and How It Is Used. ACS Biomater Sci Eng. 2016; 1(10):864-876. PMC: 4936833. DOI: 10.1021/acsbiomaterials.5b00152. View

4.
Phillips J, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E . Scalable molecular dynamics with NAMD. J Comput Chem. 2005; 26(16):1781-802. PMC: 2486339. DOI: 10.1002/jcc.20289. View

5.
Buehler M . Materials by Design-A Perspective From Atoms to Structures. MRS Bull. 2013; 38(2):169-176. PMC: 3806500. DOI: 10.1557/mrs.2013.26. View