Prediction of the Surface-interior Diagram of Globular Proteins by an Empirical Method
Overview
Authors
Affiliations
The number of amino acid residues in contact with a residue in a globular protein is a simple and good measure to show the relative location of the residue on the surface or in the interior of the protein. The contact number is estimated as the number of C alpha atoms within a sphere of radius r (8 A) centered at the C alpha atom of a given residue. The prediction of a diagram (the plot of the contact number against the residue number) from a given amino acid sequence may be meaningful as an alternative to the secondary-structure prediction currently performed. Parameter values are determined empirically using the observed contact numbers calculated from known structures of 39 proteins. In order to assess the real efficiency of the method, the prediction has been performed in the following way; all the proteins are divided into two groups; one group is used to derive parameter sets and the other serves to test the prediction accuracy. The test reveals that the parameter sets empirically determined are biased significantly towards the data base, the extent of which is roughly proportional to the number of parameter terms included. The results show that an adequate smoothing of a parameter set is the best way to reduce the extent of biasing towards the data base and to give the best prediction for 'unknown' proteins. The prediction accuracy finally obtained is about 0.4 (or roughly 70%), on the average, measured by the correlation coefficient between the predicted and observed diagrams. This value is of the same order as the accuracy in the current predictions of secondary structures.
Enhancing Sumoylation Site Prediction: A Deep Neural Network with Discriminative Features.
Khan S, Khan M, Iqbal N, Dilshad N, Almufareh M, Alsubaie N Life (Basel). 2023; 13(11).
PMID: 38004293 PMC: 10672286. DOI: 10.3390/life13112153.
Ji J, Carpentier B, Chakraborty A, Nangia S J Chem Theory Comput. 2023; 20(4):1656-1672.
PMID: 37018141 PMC: 10902853. DOI: 10.1021/acs.jctc.3c00106.
Hawley K, Montezuma-Rusca J, Delgado K, Singh N, Uversky V, Caimano M J Bacteriol. 2021; 203(15):e0008221.
PMID: 33972353 PMC: 8407342. DOI: 10.1128/JB.00082-21.
Enhancement of conformational B-cell epitope prediction using CluSMOTE.
Solihah B, Azhari A, Musdholifah A PeerJ Comput Sci. 2021; 6:e275.
PMID: 33816926 PMC: 7924438. DOI: 10.7717/peerj-cs.275.
Timmons P, Hewage C Sci Rep. 2020; 10(1):10869.
PMID: 32616760 PMC: 7331684. DOI: 10.1038/s41598-020-67701-3.