» Articles » PMID: 31283472

HerGePred: Heterogeneous Network Embedding Representation for Disease Gene Prediction

Overview
Date 2019 Jul 9
PMID 31283472
Citations 22
Authors
Affiliations
Soon will be listed here.
Abstract

The discovery of disease-causing genes is a critical step towards understanding the nature of a disease and determining a possible cure for it. In recent years, many computational methods to identify disease genes have been proposed. However, making full use of disease-related (e.g., symptoms) and gene-related (e.g., gene ontology and protein-protein interactions) information to improve the performance of disease gene prediction is still an issue. Here, we develop a heterogeneous disease-gene-related network (HDGN) embedding representation framework for disease gene prediction (called HerGePred). Based on this framework, a low-dimensional vector representation (LVR) of the nodes in the HDGN can be obtained. Then, we propose two specific algorithms, namely, an LVR-based similarity prediction and a random walk with restart on a reconstructed heterogeneous disease-gene network (RW-RDGN), to predict disease genes with high performance. First, to validate the rationality of the framework, we analyze the similarity-based overlap distribution of disease pairs and design an experiment for disease-gene association recovery, the results of which revealed that the LVR of nodes performs well at preserving the local and global network structure of the HDGN. Then, we apply tenfold cross validation and external validation to compare our methods with other well-known disease gene prediction algorithms. The experimental results show that the RW-RDGN performs better than the state-of-the-art algorithm. The prediction results of disease candidate genes are essential for molecular mechanism investigation and experimental validation. The source codes of HerGePred and experimental data are available at https://github.com/yangkuoone/HerGePred.

Citing Articles

Simplicity within biological complexity.

Przulj N, Malod-Dognin N Bioinform Adv. 2025; 5(1):vbae164.

PMID: 39927291 PMC: 11805345. DOI: 10.1093/bioadv/vbae164.


Global-local aware Heterogeneous Graph Contrastive Learning for multifaceted association prediction in miRNA-gene-disease networks.

Si Y, Huang Z, Fang Z, Yuan Z, Huang Z, Li Y Brief Bioinform. 2024; 25(5).

PMID: 39256197 PMC: 11387071. DOI: 10.1093/bib/bbae443.


Function-Genes and Disease-Genes Prediction Based on Network Embedding and One-Class Classification.

Shi W, Zhang Y, Sun Y, Lin Z Interdiscip Sci. 2024; 16(4):781-801.

PMID: 39230798 DOI: 10.1007/s12539-024-00638-7.


Heterogeneous biomedical entity representation learning for gene-disease association prediction.

Meng Z, Liu S, Liang S, Jani B, Meng Z Brief Bioinform. 2024; 25(5).

PMID: 39154194 PMC: 11330343. DOI: 10.1093/bib/bbae380.


Research on Artificial-Intelligence-Assisted Medicine: A Survey on Medical Artificial Intelligence.

Gou F, Liu J, Xiao C, Wu J Diagnostics (Basel). 2024; 14(14).

PMID: 39061610 PMC: 11275417. DOI: 10.3390/diagnostics14141472.