» Articles » PMID: 12577269

Real Value Prediction of Solvent Accessibility from Amino Acid Sequence

Overview
Journal Proteins
Date 2003 Feb 11
PMID 12577269
Citations 63
Authors
Affiliations
Soon will be listed here.
Abstract

The solvent accessibility of amino acid residues has been predicted in the past by classifying them into exposure states with varying thresholds. This classification provides a wide range of values for the accessible surface area (ASA) within which a residue may fall. Thus far, no attempt has been made to predict real values of ASA from the sequence information without a priori classification into exposure states. Here, we present a new method with which to predict real value ASAs for residues, based on neighborhood information. Our real value prediction neural network could estimate the ASA for four different nonhomologous, nonredundant data sets of varying size, with 18.0-19.5% mean absolute error, defined as per residue absolute difference between the predicted and experimental values of relative ASA. Correlation between the predicted and experimental values ranged from 0.47 to 0.50. It was observed that the ASA of a residue could be predicted within a 23.7% mean absolute error, even when no information about its neighbors is included. Prediction of real values answers the issue of arbitrary choice of ASA state thresholds, and carries more information than category prediction. Prediction error for each residue type strongly correlates with the variability in its experimental ASA values.

Citing Articles

Advances in the Application of Protein Language Modeling for Nucleic Acid Protein Binding Site Prediction.

Wang B, Li W Genes (Basel). 2024; 15(8).

PMID: 39202449 PMC: 11353971. DOI: 10.3390/genes15081090.


Information quantity for secondary structure propensities of protein subsequences in the Protein Data Bank.

Kondo R, Kasahara K, Takahashi T Biophys Physicobiol. 2022; 19:1-12.

PMID: 35532457 PMC: 8926306. DOI: 10.2142/biophysico.bppb-v19.0002.


The influence of dataset homology and a rigorous evaluation strategy on protein secondary structure prediction.

Chen T, Lo C, Juan S, Lo W PLoS One. 2021; 16(7):e0254555.

PMID: 34260641 PMC: 8279362. DOI: 10.1371/journal.pone.0254555.


Multifaceted analysis of training and testing convolutional neural networks for protein secondary structure prediction.

Shapovalov M, Dunbrack Jr R, Vucetic S PLoS One. 2020; 15(5):e0232528.

PMID: 32374785 PMC: 7202669. DOI: 10.1371/journal.pone.0232528.


SXGBsite: Prediction of Protein-Ligand Binding Sites Using Sequence Information and Extreme Gradient Boosting.

Zhao Z, Xu Y, Zhao Y Genes (Basel). 2019; 10(12).

PMID: 31771119 PMC: 6947422. DOI: 10.3390/genes10120965.