» Articles » PMID: 24267980

Protein Function Prediction by Collective Classification with Explicit and Implicit Edges in Protein-protein Interaction Networks

Overview
Publisher Biomed Central
Specialty Biology
Date 2013 Nov 26
PMID 24267980
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Protein function prediction is an important problem in the post-genomic era. Recent advances in experimental biology have enabled the production of vast amounts of protein-protein interaction (PPI) data. Thus, using PPI data to functionally annotate proteins has been extensively studied. However, most existing network-based approaches do not work well when annotation and interaction information is inadequate in the networks.

Results: In this paper, we proposed a new method that combines PPI information and protein sequence information to boost the prediction performance based on collective classification. Our method divides function prediction into two phases: First, the original PPI network is enriched by adding a number of edges that are inferred from protein sequence information. We call the added edges implicit edges, and the existing ones explicit edges correspondingly. Second, a collective classification algorithm is employed on the new network to predict protein function.

Conclusions: We conducted extensive experiments on two real, publicly available PPI datasets. Compared to four existing protein function prediction approaches, our method performs better in many situations, which shows that adding implicit edges can indeed improve the prediction performance. Furthermore, the experimental results also indicate that our method is significantly better than the compared approaches in sparsely-labeled networks, and it is robust to the change of the proportion of annotated proteins.

Citing Articles

Predicting the pro-longevity or anti-longevity effect of model organism genes with enhanced Gaussian noise augmentation-based contrastive learning on protein-protein interaction networks.

Alsaggaf I, Freitas A, Wan C NAR Genom Bioinform. 2024; 6(4):lqae153.

PMID: 39633720 PMC: 11616696. DOI: 10.1093/nargab/lqae153.


Protein Function Prediction Based on PPI Networks: Network Reconstruction vs Edge Enrichment.

Zhou J, Xiong W, Wang Y, Guan J Front Genet. 2021; 12:758131.

PMID: 34970299 PMC: 8712557. DOI: 10.3389/fgene.2021.758131.


FunPred 3.0: improved protein function prediction using protein interaction network.

Saha S, Chatterjee P, Basu S, Nasipuri M, Plewczynski D PeerJ. 2019; 7:e6830.

PMID: 31198622 PMC: 6535044. DOI: 10.7717/peerj.6830.


Predicting biomedical metadata in CEDAR: A study of Gene Expression Omnibus (GEO).

Panahiazar M, Dumontier M, Gevaert O J Biomed Inform. 2017; 72:132-139.

PMID: 28625880 PMC: 5643580. DOI: 10.1016/j.jbi.2017.06.017.


A survey of computational intelligence techniques in protein function prediction.

Tiwari A, Srivastava R Int J Proteomics. 2015; 2014:845479.

PMID: 25574395 PMC: 4276698. DOI: 10.1155/2014/845479.


References
1.
Nabieva E, Jim K, Agarwal A, Chazelle B, Singh M . Whole-proteome prediction of protein function via graph-theoretic analysis of interaction maps. Bioinformatics. 2005; 21 Suppl 1:i302-10. DOI: 10.1093/bioinformatics/bti1054. View

2.
Chua H, Sung W, Wong L . Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions. Bioinformatics. 2006; 22(13):1623-30. DOI: 10.1093/bioinformatics/btl145. View

3.
Watson J, Laskowski R, Thornton J . Predicting protein function from sequence and structural data. Curr Opin Struct Biol. 2005; 15(3):275-84. DOI: 10.1016/j.sbi.2005.04.003. View

4.
Sharan R, Ulitsky I, Shamir R . Network-based prediction of protein function. Mol Syst Biol. 2007; 3:88. PMC: 1847944. DOI: 10.1038/msb4100129. View

5.
Dunn R, Dudbridge F, Sanderson C . The use of edge-betweenness clustering to investigate biological function in protein interaction networks. BMC Bioinformatics. 2005; 6:39. PMC: 555937. DOI: 10.1186/1471-2105-6-39. View