» Articles » PMID: 35140599

An Iterative Method for Predicting Essential Proteins Based on Multifeature Fusion and Linear Neighborhood Similarity

Overview
Specialty Geriatrics
Date 2022 Feb 10
PMID 35140599
Authors
Affiliations
Soon will be listed here.
Abstract

Growing evidence have demonstrated that many biological processes are inseparable from the participation of key proteins. In this paper, a novel iterative method called linear neighborhood similarity-based protein multifeatures fusion (LNSPF) is proposed to identify potential key proteins based on multifeature fusion. In LNSPF, an original protein-protein interaction (PPI) network will be constructed first based on known protein-protein interaction data downloaded from benchmark databases, based on which, topological features will be further extracted. Next, gene expression data of proteins will be adopted to transfer the original PPI network to a weighted PPI network based on the linear neighborhood similarity. After that, subcellular localization and homologous information of proteins will be integrated to extract functional features for proteins, and based on both functional and topological features obtained above. And then, an iterative method will be designed and carried out to predict potential key proteins. At last, for evaluating the predictive performance of LNSPF, extensive experiments have been done, and compare results between LNPSF and 15 state-of-the-art competitive methods have demonstrated that LNSPF can achieve satisfactory recognition accuracy, which is markedly better than that achieved by each competing method.

Citing Articles

A seed expansion-based method to identify essential proteins by integrating protein-protein interaction sub-networks and multiple biological characteristics.

Zhao H, Liu G, Cao X BMC Bioinformatics. 2023; 24(1):452.

PMID: 38036960 PMC: 10688502. DOI: 10.1186/s12859-023-05583-8.


MILNP: Plant lncRNA-miRNA Interaction Prediction Based on Improved Linear Neighborhood Similarity and Label Propagation.

Cai L, Gao M, Ren X, Fu X, Xu J, Wang P Front Plant Sci. 2022; 13:861886.

PMID: 35401586 PMC: 8990282. DOI: 10.3389/fpls.2022.861886.

References
1.
Wuchty S, Stadler P . Centers of complex networks. J Theor Biol. 2003; 223(1):45-53. DOI: 10.1016/s0022-5193(03)00071-7. View

2.
Zhang W, Xue X, Xie C, Li Y, Liu J, Chen H . CEGSO: Boosting Essential Proteins Prediction by Integrating Protein Complex, Gene Expression, Gene Ontology, Subcellular Localization and Orthology Information. Interdiscip Sci. 2021; 13(3):349-361. DOI: 10.1007/s12539-021-00426-7. View

3.
Zhang R, Lin Y . DEG 5.0, a database of essential genes in both prokaryotes and eukaryotes. Nucleic Acids Res. 2008; 37(Database issue):D455-8. PMC: 2686491. DOI: 10.1093/nar/gkn858. View

4.
Hahn M, Kern A . Comparative genomics of centrality and essentiality in three eukaryotic protein-interaction networks. Mol Biol Evol. 2004; 22(4):803-6. DOI: 10.1093/molbev/msi072. View

5.
Zhang F, Peng W, Yang Y, Dai W, Song J . A Novel Method for Identifying Essential Genes by Fusing Dynamic Protein⁻Protein Interactive Networks. Genes (Basel). 2019; 10(1). PMC: 6356314. DOI: 10.3390/genes10010031. View