» Articles » PMID: 25349902

Predicting Protein Complex in Protein Interaction Network - a Supervised Learning Based Method

Overview
Journal BMC Syst Biol
Publisher Biomed Central
Specialty Biology
Date 2014 Oct 29
PMID 25349902
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Protein complexes are important for understanding principles of cellular organization and function. High-throughput experimental techniques have produced a large amount of protein interactions, making it possible to predict protein complexes from protein -protein interaction networks. However, most of current methods are unsupervised learning based methods which can't utilize the information of the large amount of available known complexes.

Methods: We present a supervised learning-based method for predicting protein complexes in protein - protein interaction networks. The method extracts rich features from both the unweighted and weighted networks to train a Regression model, which is then used for the cliques filtering, growth, and candidate complex filtering. The model utilizes additional "uncertainty" samples and, therefore, is more discriminative when used in the complex detection algorithm. In addition, our method uses the maximal cliques found by the Cliques algorithm as the initial cliques, which has been proven to be more effective than the method of expanding from the seeding proteins used in other methods.

Results: The experimental results on several PIN datasets show that in most cases the performance of our method are superior to comparable state-of-the-art protein complex detection techniques.

Conclusions: The results demonstrate the several advantages of our method over other state-of-the-art techniques. Firstly, our method is a supervised learning-based method that can make full use of the information of the available known complexes instead of being only based on the topological structure of the PIN. That also means, if more training samples are provided, our method can achieve better performance than those unsupervised methods. Secondly, we design the rich feature set to describe the properties of the known complexes, which includes not only the features from the unweighted network, but also those from the weighted network built based on the Gene Ontology information. Thirdly, our Regression model utilizes additional "uncertainty" samples and, therefore, becomes more discriminative, whose effectiveness for the complex detection is indicated by our experimental results.

Citing Articles

Heterogeneous network approaches to protein pathway prediction.

Nayar G, Altman R Comput Struct Biotechnol J. 2024; 23:2727-2739.

PMID: 39035835 PMC: 11260399. DOI: 10.1016/j.csbj.2024.06.022.


Integration of protein sequence and protein-protein interaction data by hypergraph learning to identify novel protein complexes.

Xia S, Li D, Deng X, Liu Z, Zhu H, Liu Y Brief Bioinform. 2024; 25(4).

PMID: 38851299 PMC: 11162299. DOI: 10.1093/bib/bbae274.


A supervised protein complex prediction method with network representation learning and gene ontology knowledge.

Wang X, Zhang Y, Zhou P, Liu X BMC Bioinformatics. 2022; 23(1):300.

PMID: 35879648 PMC: 9317086. DOI: 10.1186/s12859-022-04850-4.


An Ensemble Learning Framework for Detecting Protein Complexes From PPI Networks.

Wang R, Ma H, Wang C Front Genet. 2022; 13:839949.

PMID: 35281831 PMC: 8908451. DOI: 10.3389/fgene.2022.839949.


Super.Complex: A supervised machine learning pipeline for molecular complex detection in protein-interaction networks.

Palukuri M, Marcotte E PLoS One. 2021; 16(12):e0262056.

PMID: 34972161 PMC: 8719692. DOI: 10.1371/journal.pone.0262056.


References
1.
Tarassov K, Messier V, Landry C, Radinovic S, Serna Molina M, Shames I . An in vivo map of the yeast protein interactome. Science. 2008; 320(5882):1465-70. DOI: 10.1126/science.1153878. View

2.
Brohee S, van Helden J . Evaluation of clustering algorithms for protein-protein interaction networks. BMC Bioinformatics. 2006; 7:488. PMC: 1637120. DOI: 10.1186/1471-2105-7-488. View

3.
Li X, Wu M, Kwoh C, Ng S . Computational approaches for detecting protein complexes from protein interaction networks: a survey. BMC Genomics. 2010; 11 Suppl 1:S3. PMC: 2822531. DOI: 10.1186/1471-2164-11-S1-S3. View

4.
Shi L, Lei X, Zhang A . Protein complex detection with semi-supervised learning in protein interaction networks. Proteome Sci. 2011; 9 Suppl 1:S5. PMC: 3289084. DOI: 10.1186/1477-5956-9-S1-S5. View

5.
Krogan N, Cagney G, Yu H, Zhong G, Guo X, Ignatchenko A . Global landscape of protein complexes in the yeast Saccharomyces cerevisiae. Nature. 2006; 440(7084):637-43. DOI: 10.1038/nature04670. View