» Articles » PMID: 32302512

Predicting Protein Functions Based on Differential Co-expression and Neighborhood Analysis

Overview
Journal J Comput Biol
Date 2020 Apr 18
PMID 32302512
Authors
Affiliations
Soon will be listed here.
Abstract

Proteins are polypeptides essential in biological processes. Protein physical interactions are complemented by other types of functional relationship data including genetic interactions, knowledge about co-expression, and evolutionary pathways. Existing algorithms integrate protein interaction and gene expression data to retrieve context-specific subnetworks composed of genes/proteins with known and unknown functions. However, most protein function prediction algorithms fail to exploit diverse intrinsic information in feature and label spaces. We develop a novel integrative method based on differential Co-expression analysis and Neighbor-voting algorithm for Protein Function Prediction, namely CNPFP. The method integrates heterogeneous data and exploits intrinsic and latent linkages via global iterative approach and genomic features. CNPFP performs three tasks: clustering, differential co-expression analysis, and predicts protein functions. Our aim is to identify yeast cell cycle-specific proteins linked to differentially expressed proteins in the protein-protein interaction network. To capture intrinsic information, CNPFP selects the most relevant feature subset based on global iterative neighbor-voting algorithm. We identify eight condition-specific modules. The most relevant subnetwork has 87 genes highly enriched with cyclin-dependent kinases, a protein kinase relevant for cell cycle regulation. We present comprehensive annotations for 3538 proteins. Our method achieves an AUROC of 0.9862, accuracy of 0.9710, and -score of 0.9691. From the results, we can summarize that exploiting intrinsic nature of protein relationships improves the quality of function prediction. Thus, the proposed method is useful in functional genomics studies.

Citing Articles

A hybrid machine learning framework for functional annotation of mitochondrial glutathione transport and metabolism proteins in cancers.

Kennedy L, Sandhu J, Harper M, Cuperlovic-Culf M BMC Bioinformatics. 2025; 26(1):48.

PMID: 39934670 PMC: 11817629. DOI: 10.1186/s12859-025-06051-1.

References
1.
OMeara M, Ballouz S, Shoichet B, Gillis J . Ligand Similarity Complements Sequence, Physical Interaction, and Co-Expression for Gene Function Prediction. PLoS One. 2016; 11(7):e0160098. PMC: 4965129. DOI: 10.1371/journal.pone.0160098. View

2.
Meng J, Wekesa J, Shi G, Luan Y . Protein function prediction based on data fusion and functional interrelationship. Math Biosci. 2016; 274:25-32. DOI: 10.1016/j.mbs.2016.02.001. View

3.
Zhao B, Hu S, Li X, Zhang F, Tian Q, Ni W . An efficient method for protein function annotation based on multilayer protein networks. Hum Genomics. 2016; 10(1):33. PMC: 5039885. DOI: 10.1186/s40246-016-0087-x. View

4.
Hu W, Lin X, Chen K . Integrated analysis of differential gene expression profiles in hippocampi to identify candidate genes involved in Alzheimer's disease. Mol Med Rep. 2015; 12(5):6679-87. PMC: 4626122. DOI: 10.3892/mmr.2015.4271. View

5.
Dong J, Horvath S . Understanding network concepts in modules. BMC Syst Biol. 2007; 1:24. PMC: 3238286. DOI: 10.1186/1752-0509-1-24. View