» Articles » PMID: 24564915

A Combined Approach for Genome Wide Protein Function Annotation/prediction

Overview
Journal Proteome Sci
Publisher Biomed Central
Date 2014 Feb 26
PMID 24564915
Citations 9
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Today large scale genome sequencing technologies are uncovering an increasing amount of new genes and proteins, which remain uncharacterized. Experimental procedures for protein function prediction are low throughput by nature and thus can't be used to keep up with the rate at which new proteins are discovered. On the other hand, proteins are the prominent stakeholders in almost all biological processes, and therefore the need to precisely know their functions for a better understanding of the underlying biological mechanism is inevitable. The challenge of annotating uncharacterized proteins in functional genomics and biology in general motivates the use of computational techniques well orchestrated to accurately predict their functions.

Methods: We propose a computational flow for the functional annotation of a protein able to assign the most probable functions to a protein by aggregating heterogeneous information. Considered information include: protein motifs, protein sequence similarity, and protein homology data gathered from interacting proteins, combined with data from highly similar non-interacting proteins (hereinafter called Similactors). Moreover, to increase the predictive power of our model we also compute and integrate term specific relationships among functional terms based on Gene Ontology (GO).

Results: We tested our method on Saccharomyces Cerevisiae and Homo sapiens species proteins. The aggregation of different structural and functional evidence with GO relationships outperforms, in terms of precision and accuracy of prediction than the other methods reported in literature. The predicted precision and accuracy is 100% for more than half of the input set for both species; overall, we obtained 85.38% precision and 81.95% accuracy for Homo sapiens and 79.73% precision and 80.06% accuracy for Saccharomyces Cerevisiae species proteins.

Citing Articles

An NLP-based method to mine gene and function relationships from published articles.

Kumar N, Mukhtar M Sci Rep. 2025; 15(1):7503.

PMID: 40033048 PMC: 11876572. DOI: 10.1038/s41598-025-91809-z.


Self-assembled peptide and protein nanostructures for anti-cancer therapy: Targeted delivery, stimuli-responsive devices and immunotherapy.

Delfi M, Sartorius R, Ashrafizadeh M, Sharifi E, Zhang Y, De Berardinis P Nano Today. 2021; 38.

PMID: 34267794 PMC: 8276870. DOI: 10.1016/j.nantod.2021.101119.


A three-way approach for protein function classification.

Ur Rehman H, Azam N, Yao J, Benso A PLoS One. 2017; 12(2):e0171702.

PMID: 28234929 PMC: 5325230. DOI: 10.1371/journal.pone.0171702.


Interspecies gene function prediction using semantic similarity.

Yu G, Luo W, Fu G, Wang J BMC Syst Biol. 2017; 10(Suppl 4):121.

PMID: 28155711 PMC: 5260010. DOI: 10.1186/s12918-016-0361-5.


Large-scale identification of human protein function using topological features of interaction network.

Li Z, Liu Z, Zhong W, Huang M, Wu N, Xie Y Sci Rep. 2016; 6:37179.

PMID: 27849060 PMC: 5111120. DOI: 10.1038/srep37179.


References
1.
Mistry M, Pavlidis P . Gene Ontology term overlap as a measure of gene functional similarity. BMC Bioinformatics. 2008; 9:327. PMC: 2518162. DOI: 10.1186/1471-2105-9-327. View

2.
Salwinski L, Miller C, Smith A, Pettit F, Bowie J, Eisenberg D . The Database of Interacting Proteins: 2004 update. Nucleic Acids Res. 2003; 32(Database issue):D449-51. PMC: 308820. DOI: 10.1093/nar/gkh086. View

3.
Chua H, Sung W, Wong L . Exploiting indirect neighbours and topological weight to predict protein function from protein-protein interactions. Bioinformatics. 2006; 22(13):1623-30. DOI: 10.1093/bioinformatics/btl145. View

4.
Karaoz U, Murali T, Letovsky S, Zheng Y, Ding C, Cantor C . Whole-genome annotation by using evidence integration in functional-linkage networks. Proc Natl Acad Sci U S A. 2004; 101(9):2888-93. PMC: 365715. DOI: 10.1073/pnas.0307326101. View

5.
Bogdanov P, Singh A . Molecular function prediction using neighborhood features. IEEE/ACM Trans Comput Biol Bioinform. 2010; 7(2):208-17. DOI: 10.1109/TCBB.2009.81. View