» Articles » PMID: 32550764

Predicting Self-Interacting Proteins Using a Recurrent Neural Network and Protein Evolutionary Information

Overview
Publisher Sage Publications
Specialty Biology
Date 2020 Jun 20
PMID 32550764
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

Self-interacting proteins (SIPs) play crucial roles in biological activities of organisms. Many high-throughput methods can be used to identify SIPs. However, these methods are both time-consuming and expensive. How to develop effective computational approaches for identifying SIPs is a challenging task. In the article, we present a novel computational method called RRN-SIFT, which combines the recurrent neural network (RNN) with scale invariant feature transform (SIFT) to predict SIPs based on protein evolutionary information. The main advantage of the proposed RNN-SIFT model is that it uses SIFT for extracting key feature by exploring the evolutionary information embedded in Position-Specific Iterated BLAST-constructed position-specific scoring matrix and employs an RNN classifier to perform classification based on extracted features. Extensive experiments show that the RRN-SIFT obtained average accuracy of 94.34% and 97.12% on the and dataset, respectively. We also compared our performance with the back propagation neural network (BPNN), the state-of-the-art support vector machine (SVM), and other existing methods. By comparing with experimental results, the performance of RNN-SIFT is significantly better than that of the BPNN, SVM, and other previous methods in the domain. Therefore, we conclude that the proposed RNN-SIFT model is a useful tool for predicting SIPs, as well to solve other bioinformatics tasks. To facilitate widely studies and encourage future proteomics research, a freely available web server called RNN-SIFT-SIPs was developed at http://219.219.62.123:8888/RNNSIFT/ including the source code and the SIP datasets.

Citing Articles

Robust and accurate prediction of self-interacting proteins from protein sequence information by exploiting weighted sparse representation based classifier.

Li Y, Hu X, You Z, Li L, Li P, Wang Y BMC Bioinformatics. 2022; 23(Suppl 7):518.

PMID: 36457083 PMC: 9713954. DOI: 10.1186/s12859-022-04880-y.

References
1.
Breuer K, Foroushani A, Laird M, Chen C, Sribnaia A, Lo R . InnateDB: systems biology of innate immunity and beyond--recent updates and continuing curation. Nucleic Acids Res. 2012; 41(Database issue):D1228-33. PMC: 3531080. DOI: 10.1093/nar/gks1147. View

2.
Katsamba P, Carroll K, Ahlsen G, Bahna F, Vendome J, Posy S . Linking molecular affinity and cellular specificity in cadherin-mediated adhesion. Proc Natl Acad Sci U S A. 2009; 106(28):11594-9. PMC: 2710653. DOI: 10.1073/pnas.0905349106. View

3.
Liu Z, Guo F, Zhang J, Wang J, Lu L, Li D . Proteome-wide prediction of self-interacting proteins based on multiple properties. Mol Cell Proteomics. 2013; 12(6):1689-700. PMC: 3675823. DOI: 10.1074/mcp.M112.021790. View

4.
Hattori T, Ohoka N, Inoue Y, Hayashi H, Onozaki K . C/EBP family transcription factors are degraded by the proteasome but stabilized by forming dimer. Oncogene. 2003; 22(9):1273-80. DOI: 10.1038/sj.onc.1206204. View

5.
Xenarios I, Rice D, Salwinski L, Baron M, Marcotte E, Eisenberg D . DIP: the database of interacting proteins. Nucleic Acids Res. 1999; 28(1):289-91. PMC: 102387. DOI: 10.1093/nar/28.1.289. View