» Articles » PMID: 14555619

Learning to Predict Protein-protein Interactions from Protein Sequences

Overview
Journal Bioinformatics
Specialty Biology
Date 2003 Oct 14
PMID 14555619
Citations 52
Authors
Affiliations
Soon will be listed here.
Abstract

In order to understand the molecular machinery of the cell, we need to know about the multitude of protein-protein interactions that allow the cell to function. High-throughput technologies provide some data about these interactions, but so far that data is fairly noisy. Therefore, computational techniques for predicting protein-protein interactions could be of significant value. One approach to predicting interactions in silico is to produce from first principles a detailed model of a candidate interaction. We take an alternative approach, employing a relatively simple model that learns dynamically from a large collection of data. In this work, we describe an attraction-repulsion model, in which the interaction between a pair of proteins is represented as the sum of attractive and repulsive forces associated with small, domain- or motif-sized features along the length of each protein. The model is discriminative, learning simultaneously from known interactions and from pairs of proteins that are known (or suspected) not to interact. The model is efficient to compute and scales well to very large collections of data. In a cross-validated comparison using known yeast interactions, the attraction-repulsion method performs better than several competing techniques.

Citing Articles

Improved cytokine-receptor interaction prediction by exploiting the negative sample space.

Nath A, Leier A BMC Bioinformatics. 2020; 21(1):493.

PMID: 33129275 PMC: 7603689. DOI: 10.1186/s12859-020-03835-5.


Classification in biological networks with hypergraphlet kernels.

Lugo-Martinez J, Zeiberg D, Gaudelet T, Malod-Dognin N, Przulj N, Radivojac P Bioinformatics. 2020; 37(7):1000-1007.

PMID: 32886115 PMC: 8128478. DOI: 10.1093/bioinformatics/btaa768.


Protein-protein interaction sites prediction by ensemble random forests with synthetic minority oversampling technique.

Wang X, Yu B, Ma A, Chen C, Liu B, Ma Q Bioinformatics. 2018; 35(14):2395-2402.

PMID: 30520961 PMC: 6612859. DOI: 10.1093/bioinformatics/bty995.


Evaluating the impact of topological protein features on the negative examples selection.

Boldi P, Frasca M, Malchiodi D BMC Bioinformatics. 2018; 19(Suppl 14):417.

PMID: 30453879 PMC: 6245585. DOI: 10.1186/s12859-018-2385-x.


Predicting protein-protein interactions through sequence-based deep learning.

Hashemifar S, Neyshabur B, Khan A, Xu J Bioinformatics. 2018; 34(17):i802-i810.

PMID: 30423091 PMC: 6129267. DOI: 10.1093/bioinformatics/bty573.