» Articles » PMID: 30598091

Hot Spot Prediction in Protein-protein Interactions by an Ensemble System

Overview
Journal BMC Syst Biol
Publisher Biomed Central
Specialty Biology
Date 2019 Jan 2
PMID 30598091
Citations 17
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Hot spot residues are functional sites in protein interaction interfaces. The identification of hot spot residues is time-consuming and laborious using experimental methods. In order to address the issue, many computational methods have been developed to predict hot spot residues. Moreover, most prediction methods are based on structural features, sequence characteristics, and/or other protein features.

Results: This paper proposed an ensemble learning method to predict hot spot residues that only uses sequence features and the relative accessible surface area of amino acid sequences. In this work, a novel feature selection technique was developed, an auto-correlation function combined with a sliding window technique was applied to obtain the characteristics of amino acid residues in protein sequence, and an ensemble classifier with SVM and KNN base classifiers was built to achieve the best classification performance.

Conclusion: The experimental results showed that our model yields the highest F1 score of 0.92 and an MCC value of 0.87 on ASEdb dataset. Compared with other machine learning methods, our model achieves a big improvement in hot spot prediction.

Availability: http://deeplearner.ahu.edu.cn/web/HotspotEL.htm .

Citing Articles

PPI-hotspot for detecting protein-protein interaction hot spots from the free protein structure.

Chen Y, Sargsyan K, Wright J, Chen Y, Huang Y, Lim C Elife. 2024; 13.

PMID: 39283314 PMC: 11405013. DOI: 10.7554/eLife.96643.


Oral_voting_transfer: classification of oral microorganisms' function proteins with voting transfer model.

Bao W, Liu Y, Chen B Front Microbiol. 2024; 14:1277121.

PMID: 38384719 PMC: 10879614. DOI: 10.3389/fmicb.2023.1277121.


Prediction of hot spots towards drug discovery by protein sequence embedding with 1D convolutional neural network.

Zhang Y, Yao S, Chen P PLoS One. 2023; 18(9):e0290899.

PMID: 37721924 PMC: 10506709. DOI: 10.1371/journal.pone.0290899.


RF_phage virion: Classification of phage virion proteins with a random forest model.

Zhang Y, Li Z Front Genet. 2023; 13:1103783.

PMID: 36846294 PMC: 9945117. DOI: 10.3389/fgene.2022.1103783.


Glycoprotein attachment with host cell surface receptor ephrin B2 and B3 in mediating entry of nipah and hendra virus: a computational investigation.

Priyadarsinee L, Sarma H, Sastry G J Chem Sci (Bangalore). 2022; 134(4):114.

PMID: 36465097 PMC: 9685031. DOI: 10.1007/s12039-022-02110-9.


References
1.
Liu B, Wu H, Zhang D, Wang X, Chou K . Pse-Analysis: a python package for DNA/RNA and protein/ peptide sequence analysis based on pseudo components and kernel methods. Oncotarget. 2017; 8(8):13338-13343. PMC: 5355101. DOI: 10.18632/oncotarget.14524. View

2.
Romero R, Iglesias E, Borrajo L . A linear-RBF multikernel SVM to classify big text corpora. Biomed Res Int. 2015; 2015:878291. PMC: 4386713. DOI: 10.1155/2015/878291. View

3.
Liu Q, Ren J, Song J, Li J . Co-Occurring Atomic Contacts for the Characterization of Protein Binding Hot Spots. PLoS One. 2015; 10(12):e0144486. PMC: 4684219. DOI: 10.1371/journal.pone.0144486. View

4.
Marsh J, Teichmann S . Relative solvent accessible surface area predicts protein conformational changes upon binding. Structure. 2011; 19(6):859-67. PMC: 3145976. DOI: 10.1016/j.str.2011.03.010. View

5.
Tuncbag N, Gursoy A, Keskin O . Identification of computational hot spots in protein interfaces: combining solvent accessibility and inter-residue potentials improves the accuracy. Bioinformatics. 2009; 25(12):1513-20. DOI: 10.1093/bioinformatics/btp240. View