» Articles » PMID: 27792167

Predicting Protein-Protein Interaction Sites Using Sequence Descriptors and Site Propensity of Neighboring Amino Acids

Overview
Journal Int J Mol Sci
Publisher MDPI
Date 2016 Oct 30
PMID 27792167
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Information about the interface sites of Protein-Protein Interactions (PPIs) is useful for many biological research works. However, despite the advancement of experimental techniques, the identification of PPI sites still remains as a challenging task. Using a statistical learning technique, we proposed a computational tool for predicting PPI interaction sites. As an alternative to similar approaches requiring structural information, the proposed method takes all of the input from protein sequences. In addition to typical sequence features, our method takes into consideration that interaction sites are not randomly distributed over the protein sequence. We characterized this positional preference using protein complexes with known structures, proposed a numerical index to estimate the propensity and then incorporated the index into a learning system. The resulting predictor, without using structural information, yields an area under the ROC curve (AUC) of 0.675, recall of 0.597, precision of 0.311 and accuracy of 0.583 on a ten-fold cross-validation experiment. This performance is comparable to the previous approach in which structural information was used. Upon introducing the B-factor data to our predictor, we demonstrated that the AUC can be further improved to 0.750. The tool is accessible at http://bsaltools.ym.edu.tw/predppis.

Citing Articles

A Deep Learning and XGBoost-Based Method for Predicting Protein-Protein Interaction Sites.

Wang P, Zhang G, Yu Z, Huang G Front Genet. 2021; 12:752732.

PMID: 34764983 PMC: 8576272. DOI: 10.3389/fgene.2021.752732.


Next Generation Techniques for Determination of Protein-Protein Interactions: Beyond the Crystal Structure.

Carter R, Luchini A, Liotta L, Haymond A Curr Pathobiol Rep. 2020; 7(3):61-71.

PMID: 33094031 PMC: 7577580. DOI: 10.1007/s40139-019-00198-2.


Developing Computational Model to Predict Protein-Protein Interaction Sites Based on the XGBoost Algorithm.

Deng A, Zhang H, Wang W, Zhang J, Fan D, Chen P Int J Mol Sci. 2020; 21(7).

PMID: 32218345 PMC: 7178137. DOI: 10.3390/ijms21072274.


Prediction of Protein-Protein Interaction Sites Using Convolutional Neural Network and Improved Data Sets.

Xie Z, Deng X, Shu K Int J Mol Sci. 2020; 21(2).

PMID: 31940793 PMC: 7013409. DOI: 10.3390/ijms21020467.


Evolution of In Silico Strategies for Protein-Protein Interaction Drug Discovery.

Macalino S, Basith S, Clavio N, Chang H, Kang S, Choi S Molecules. 2018; 23(8).

PMID: 30082644 PMC: 6222862. DOI: 10.3390/molecules23081963.

References
1.
Liu Z, Xiao X, Yu D, Jia J, Qiu W, Chou K . pRNAm-PC: Predicting N(6)-methyladenosine sites in RNA sequences via physical-chemical properties. Anal Biochem. 2016; 497:60-7. DOI: 10.1016/j.ab.2015.12.017. View

2.
Das R, Dimitrova N, Xuan Z, Rollins R, Haghighi F, Edwards J . Computational prediction of methylation status in human genomic sequences. Proc Natl Acad Sci U S A. 2006; 103(28):10713-6. PMC: 1502297. DOI: 10.1073/pnas.0602949103. View

3.
Nooren I, Thornton J . Structural characterisation and functional significance of transient protein-protein interactions. J Mol Biol. 2003; 325(5):991-1018. DOI: 10.1016/s0022-2836(02)01281-0. View

4.
Rao V, Srinivas K, Sujini G, Kumar G . Protein-protein interaction detection: methods and analysis. Int J Proteomics. 2014; 2014:147648. PMC: 3947875. DOI: 10.1155/2014/147648. View

5.
Jia J, Liu Z, Xiao X, Liu B, Chou K . Identification of protein-protein binding sites by incorporating the physicochemical properties and stationary wavelet transforms into pseudo amino acid composition. J Biomol Struct Dyn. 2015; 34(9):1946-61. DOI: 10.1080/07391102.2015.1095116. View