» Articles » PMID: 15146500

Prediction of the Bonding States of Cysteines Using the Support Vector Machines Based on Multiple Feature Vectors and Cysteine State Sequences

Overview
Journal Proteins
Date 2004 May 18
PMID 15146500
Citations 17
Authors
Affiliations
Soon will be listed here.
Abstract

The support vector machine (SVM) method is used to predict the bonding states of cysteines. Besides using local descriptors such as the local sequences, we include global information, such as amino acid compositions and the patterns of the states of cysteines (bonded or nonbonded), or cysteine state sequences, of the proteins. We found that SVM based on local sequences or global amino acid compositions yielded similar prediction accuracies for the data set comprising 4136 cysteine-containing segments extracted from 969 nonhomologous proteins. However, the SVM method based on multiple feature vectors (combining local sequences and global amino acid compositions) significantly improves the prediction accuracy, from 80% to 86%. If coupled with cysteine state sequences, SVM based on multiple feature vectors yields 90% in overall prediction accuracy and a 0.77 Matthews correlation coefficient, around 10% and 22% higher than the corresponding values obtained by SVM based on local sequence information.

Citing Articles

Predicting Anticancer Drug Resistance Mediated by Mutations.

Lin Y, Liu J, Chang Y, Yu C, Yi W, Lane H Pharmaceuticals (Basel). 2022; 15(2).

PMID: 35215249 PMC: 8878306. DOI: 10.3390/ph15020136.


The structure-based cancer-related single amino acid variation prediction.

Liu J, Yu C, Wu H, Chang Y, Lin C, Lu C Sci Rep. 2021; 11(1):13599.

PMID: 34193921 PMC: 8245468. DOI: 10.1038/s41598-021-92793-w.


ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network.

Cao R, Freitas C, Chan L, Sun M, Jiang H, Chen Z Molecules. 2017; 22(10).

PMID: 29039790 PMC: 6151571. DOI: 10.3390/molecules22101732.


Accurate disulfide-bonding network predictions improve ab initio structure prediction of cysteine-rich proteins.

Yang J, He B, Jang R, Zhang Y, Shen H Bioinformatics. 2015; 31(23):3773-81.

PMID: 26254435 PMC: 5898604. DOI: 10.1093/bioinformatics/btv459.


On the structural context and identification of enzyme catalytic residues.

Chien Y, Huang S Biomed Res Int. 2013; 2013:802945.

PMID: 23484160 PMC: 3581254. DOI: 10.1155/2013/802945.