» Articles » PMID: 33237964

A Thorough Analysis of the Contribution of Experimental, Derived and Sequence-based Predicted Protein-protein Interactions for Functional Annotation of Proteins

Overview
Journal PLoS One
Date 2020 Nov 25
PMID 33237964
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Physical interaction between two proteins is strong evidence that the proteins are involved in the same biological process, making Protein-Protein Interaction (PPI) networks a valuable data resource for predicting the cellular functions of proteins. However, PPI networks are largely incomplete for non-model species. Here, we tested to what extent these incomplete networks are still useful for genome-wide function prediction. We used two network-based classifiers to predict Biological Process Gene Ontology terms from protein interaction data in four species: Saccharomyces cerevisiae, Escherichia coli, Arabidopsis thaliana and Solanum lycopersicum (tomato). The classifiers had reasonable performance in the well-studied yeast, but performed poorly in the other species. We showed that this poor performance can be considerably improved by adding edges predicted from various data sources, such as text mining, and that associations from the STRING database are more useful than interactions predicted by a neural network from sequence-based features.

Citing Articles

SAFPred: synteny-aware gene function prediction for bacteria using protein embeddings.

Urhan A, Cosma B, Earl A, Manson A, Abeel T Bioinformatics. 2024; 40(6).

PMID: 38775729 PMC: 11147799. DOI: 10.1093/bioinformatics/btae328.


SAP: Synteny-aware gene function prediction for bacteria using protein embeddings.

Urhan A, Cosma B, Earl A, Manson A, Abeel T bioRxiv. 2023; .

PMID: 37205418 PMC: 10187222. DOI: 10.1101/2023.05.02.539034.


In silico and gene expression analysis of the acute inflammatory response of gilthead seabream (Sparus aurata) after subcutaneous administration of carrageenin.

Campos-Sanchez J, Mayor-Lafuente J, Guardiola F, Esteban M Fish Physiol Biochem. 2021; 47(5):1623-1643.

PMID: 34448108 PMC: 8478728. DOI: 10.1007/s10695-021-00999-6.

References
1.
Lan L, Djuric N, Guo Y, Vucetic S . MS-kNN: protein function prediction by integrating multiple data sources. BMC Bioinformatics. 2013; 14 Suppl 3:S8. PMC: 3584913. DOI: 10.1186/1471-2105-14-S3-S8. View

2.
Clark W, Radivojac P . Information-theoretic evaluation of predicted ontological annotations. Bioinformatics. 2013; 29(13):i53-61. PMC: 3694662. DOI: 10.1093/bioinformatics/btt228. View

3.
Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J . Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000; 25(1):25-9. PMC: 3037419. DOI: 10.1038/75556. View

4.
Jaeger S, Gaudan S, Leser U, Rebholz-Schuhmann D . Integrating protein-protein interactions and text mining for protein function prediction. BMC Bioinformatics. 2008; 9 Suppl 8:S2. PMC: 2500093. DOI: 10.1186/1471-2105-9-S8-S2. View

5.
Gligorijevic V, Barot M, Bonneau R . deepNF: deep network fusion for protein function prediction. Bioinformatics. 2018; 34(22):3873-3881. PMC: 6223364. DOI: 10.1093/bioinformatics/bty440. View