» Articles » PMID: 29297299

A Sparse Autoencoder-based Deep Neural Network for Protein Solvent Accessibility and Contact Number Prediction

Overview
Publisher Biomed Central
Specialty Biology
Date 2018 Jan 4
PMID 29297299
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Direct prediction of the three-dimensional (3D) structures of proteins from one-dimensional (1D) sequences is a challenging problem. Significant structural characteristics such as solvent accessibility and contact number are essential for deriving restrains in modeling protein folding and protein 3D structure. Thus, accurately predicting these features is a critical step for 3D protein structure building.

Results: In this study, we present DeepSacon, a computational method that can effectively predict protein solvent accessibility and contact number by using a deep neural network, which is built based on stacked autoencoder and a dropout method. The results demonstrate that our proposed DeepSacon achieves a significant improvement in the prediction quality compared with the state-of-the-art methods. We obtain 0.70 three-state accuracy for solvent accessibility, 0.33 15-state accuracy and 0.74 Pearson Correlation Coefficient (PCC) for the contact number on the 5729 monomeric soluble globular protein dataset. We also evaluate the performance on the CASP11 benchmark dataset, DeepSacon achieves 0.68 three-state accuracy and 0.69 PCC for solvent accessibility and contact number, respectively.

Conclusions: We have shown that DeepSacon can reliably predict solvent accessibility and contact number with stacked sparse autoencoder and a dropout approach.

Citing Articles

Co-AMPpred for in silico-aided predictions of antimicrobial peptides by integrating composition-based features.

Singh O, Hsu W, Su E BMC Bioinformatics. 2021; 22(1):389.

PMID: 34330209 PMC: 8325260. DOI: 10.1186/s12859-021-04305-2.


A Novel Approach for Drug-Target Interactions Prediction Based on Multimodal Deep Autoencoder.

Wang H, Wang J, Dong C, Lian Y, Liu D, Yan Z Front Pharmacol. 2020; 10:1592.

PMID: 32047432 PMC: 6997437. DOI: 10.3389/fphar.2019.01592.


Diagnosis of mesothelioma with deep learning.

Hu X, Yu Z Oncol Lett. 2019; 17(2):1483-1490.

PMID: 30675203 PMC: 6341823. DOI: 10.3892/ol.2018.9761.


Protein-protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique.

Wang X, Yu B, Ma A, Chen C, Liu B, Ma Q Bioinformatics. 2018; 35(14):2395-2402.

PMID: 30520961 PMC: 6612859. DOI: 10.1093/bioinformatics/bty995.


Machine Learning Approaches for Protein⁻Protein Interaction Hot Spot Prediction: Progress and Comparative Assessment.

Liu S, Liu C, Deng L Molecules. 2018; 23(10).

PMID: 30287797 PMC: 6222875. DOI: 10.3390/molecules23102535.


References
1.
Fariselli P, Casadio R . Prediction of the number of residue contacts in proteins. Proc Int Conf Intell Syst Mol Biol. 2000; 8:146-51. View

2.
Kim H, Park H . Prediction of protein relative solvent accessibility with support vector machines and long-range interaction 3D local descriptor. Proteins. 2004; 54(3):557-62. DOI: 10.1002/prot.10602. View

3.
Pollastri G, Baldi P, Fariselli P, Casadio R . Prediction of coordination number and relative solvent accessibility in proteins. Proteins. 2002; 47(2):142-53. DOI: 10.1002/prot.10069. View

4.
Adamczak R, Porollo A, Meller J . Combining prediction of secondary structure and solvent accessibility in proteins. Proteins. 2005; 59(3):467-75. DOI: 10.1002/prot.20441. View

5.
Ma J, Wang S . AcconPred: Predicting Solvent Accessibility and Contact Number Simultaneously by a Multitask Learning Framework under the Conditional Neural Fields Model. Biomed Res Int. 2015; 2015:678764. PMC: 4538422. DOI: 10.1155/2015/678764. View