» Articles » PMID: 22595002

Prediction of Protein-protein Interactions Between Viruses and Human by an SVM Model

Overview
Publisher Biomed Central
Specialty Biology
Date 2012 May 19
PMID 22595002
Citations 35
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Several computational methods have been developed to predict protein-protein interactions from amino acid sequences, but most of those methods are intended for the interactions within a species rather than for interactions across different species. Methods for predicting interactions between homogeneous proteins are not appropriate for finding those between heterogeneous proteins since they do not distinguish the interactions between proteins of the same species from those of different species.

Results: We developed a new method for representing a protein sequence of variable length in a frequency vector of fixed length, which encodes the relative frequency of three consecutive amino acids of a sequence. We built a support vector machine (SVM) model to predict human proteins that interact with virus proteins. In two types of viruses, human papillomaviruses (HPV) and hepatitis C virus (HCV), our SVM model achieved an average accuracy above 80%, which is higher than that of another SVM model with a different representation scheme. Using the SVM model and Gene Ontology (GO) annotations of proteins, we predicted new interactions between virus proteins and human proteins.

Conclusions: Encoding the relative frequency of amino acid triplets of a protein sequence is a simple yet powerful representation method for predicting protein-protein interactions across different species. The representation method has several advantages: (1) it enables a prediction model to achieve a better performance than other representations, (2) it generates feature vectors of fixed length regardless of the sequence length, and (3) the same representation is applicable to different types of proteins.

Citing Articles

Prediction of influenza A virus-human protein-protein interactions using XGBoost with continuous and discontinuous amino acids information.

Li B, Li X, Li X, Wang L, Lu J, Wang J PeerJ. 2025; 13:e18863.

PMID: 39897484 PMC: 11787804. DOI: 10.7717/peerj.18863.


HBFormer: a single-stream framework based on hybrid attention mechanism for identification of human-virus protein-protein interactions.

Zhang L, Wang S, Wang Y, Zhao T Bioinformatics. 2024; 40(12).

PMID: 39673490 PMC: 11648999. DOI: 10.1093/bioinformatics/btae724.


Prediction of Human Papillomavirus-Host Oncoprotein Interactions Using Deep Learning.

Santa S, Kwofie S, Agyenkwa-Mawuli K, Quaye O, Brown C, Tagoe E Bioinform Biol Insights. 2024; 18:11779322241304666.

PMID: 39664297 PMC: 11632871. DOI: 10.1177/11779322241304666.


Bioinformatic Resources for Exploring Human-virus Protein-protein Interactions Based on Binding Modes.

Chen H, Liu J, Tang G, Hao G, Yang G Genomics Proteomics Bioinformatics. 2024; 22(5).

PMID: 39404802 PMC: 11658832. DOI: 10.1093/gpbjnl/qzae075.


Oral_voting_transfer: classification of oral microorganisms' function proteins with voting transfer model.

Bao W, Liu Y, Chen B Front Microbiol. 2024; 14:1277121.

PMID: 38384719 PMC: 10879614. DOI: 10.3389/fmicb.2023.1277121.


References
1.
Furey T, Cristianini N, Duffy N, Bednarski D, Schummer M, Haussler D . Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics. 2000; 16(10):906-14. DOI: 10.1093/bioinformatics/16.10.906. View

2.
Martin S, Roe D, Faulon J . Predicting protein-protein interactions using signature products. Bioinformatics. 2004; 21(2):218-26. DOI: 10.1093/bioinformatics/bth483. View

3.
Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J . Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000; 25(1):25-9. PMC: 3037419. DOI: 10.1038/75556. View

4.
Lowy D, Schiller J . Prophylactic human papillomavirus vaccines. J Clin Invest. 2006; 116(5):1167-73. PMC: 1451224. DOI: 10.1172/JCI28607. View

5.
Appel N, Schaller T, Penin F, Bartenschlager R . From structure to function: new insights into hepatitis C virus RNA replication. J Biol Chem. 2006; 281(15):9833-6. DOI: 10.1074/jbc.R500026200. View