» Articles » PMID: 27283949

A Knowledge-based Approach for Predicting Gene-disease Associations

Overview
Journal Bioinformatics
Specialty Biology
Date 2016 Jun 11
PMID 27283949
Citations 21
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Recent advances of next-generation sequence technologies have made it possible to rapidly and inexpensively identify gene variations. Knowing the disease association of these gene variations is important for early intervention to treat deadly diseases and provide possible targets to cure these diseases. Genome-wide association studies (GWAS) have identified many individual genes associated with common diseases. To exploit the large amount of data obtained from GWAS studies and leverage our understanding of common as well as rare diseases, we have developed a knowledge-based approach to predict gene-disease associations. We first derive gene-gene mutual information by utilizing the cooccurrence of genes in known gene-disease association data. Subsequently, the mutual information is combined with known protein-protein interaction networks by a boosted tree regression method.

Results: The method called Know-GENE is compared with the method of random walking on the heterogeneous network using the same input data. For a set of 960 diseases, using the same training data in testing in 3-fold cross-validation, the average recall rate within the top ranked 100 genes by Know-GENE is 65.0% compared with 37.9% by the state of the art random walking on heterogeneous network. This significant improvement is mostly due to the inclusion of knowledge-based mutual information.

Availability And Implementation: Predictions for genes associated with the 960 diseases are available at http://cssb2.biology.gatech.edu/knowgene

Contact: : skolnick@gatech.edu.

Citing Articles

Graph Artificial Intelligence in Medicine.

Johnson R, Li M, Noori A, Queen O, Zitnik M Annu Rev Biomed Data Sci. 2024; 7(1):345-368.

PMID: 38749465 PMC: 11344018. DOI: 10.1146/annurev-biodatasci-110723-024625.


KDGene: knowledge graph completion for disease gene prediction using interactional tensor decomposition.

Wang X, Yang K, Jia T, Gu F, Wang C, Xu K Brief Bioinform. 2024; 25(3).

PMID: 38605639 PMC: 11009469. DOI: 10.1093/bib/bbae161.


A Comprehensive Bioinformatics Approach to Identify Molecular Signatures and Key Pathways for the Huntington Disease.

Meem T, Khan U, Mredul M, Awal M, Rahman M, Khan M Bioinform Biol Insights. 2023; 17:11779322231210098.

PMID: 38033382 PMC: 10683407. DOI: 10.1177/11779322231210098.


HetIG-PreDiG: A Heterogeneous Integrated Graph Model for Predicting Human Disease Genes based on gene expression.

Jagodnik K, Shvili Y, Bartal A PLoS One. 2023; 18(2):e0280839.

PMID: 36791052 PMC: 9931161. DOI: 10.1371/journal.pone.0280839.


Artificial Intelligence, Healthcare, Clinical Genomics, and Pharmacogenomics Approaches in Precision Medicine.

Abdelhalim H, Berber A, Lodi M, Jain R, Nair A, Pappu A Front Genet. 2022; 13:929736.

PMID: 35873469 PMC: 9299079. DOI: 10.3389/fgene.2022.929736.


References
1.
Ogutu J, Piepho H, Schulz-Streeck T . A comparison of random forests, boosting and support vector machines for genomic selection. BMC Proc. 2011; 5 Suppl 3:S11. PMC: 3103196. DOI: 10.1186/1753-6561-5-S3-S11. View

2.
Thusberg J, Olatubosun A, Vihinen M . Performance of mutation pathogenicity prediction methods on missense variants. Hum Mutat. 2011; 32(4):358-68. DOI: 10.1002/humu.21445. View

3.
Goh K, Cusick M, Valle D, Childs B, Vidal M, Barabasi A . The human disease network. Proc Natl Acad Sci U S A. 2007; 104(21):8685-90. PMC: 1885563. DOI: 10.1073/pnas.0701361104. View

4.
Natarajan N, Dhillon I . Inductive matrix completion for predicting gene-disease associations. Bioinformatics. 2014; 30(12):i60-68. PMC: 4058925. DOI: 10.1093/bioinformatics/btu269. View

5.
Barkic M, Crnomarkovic S, Grabusic K, Bogetic I, Panic L, Tamarut S . The p53 tumor suppressor causes congenital malformations in Rpl24-deficient mice and promotes their survival. Mol Cell Biol. 2009; 29(10):2489-504. PMC: 2682053. DOI: 10.1128/MCB.01588-08. View