» Articles » PMID: 34535759

GenNet Framework: Interpretable Deep Learning for Predicting Phenotypes from Genetic Data

Overview
Journal Commun Biol
Specialty Biology
Date 2021 Sep 18
PMID 34535759
Citations 20
Authors
Affiliations
Soon will be listed here.
Abstract

Applying deep learning in population genomics is challenging because of computational issues and lack of interpretable models. Here, we propose GenNet, a novel open-source deep learning framework for predicting phenotypes from genetic variants. In this framework, interpretable and memory-efficient neural network architectures are constructed by embedding biologically knowledge from public databases, resulting in neural networks that contain only biologically plausible connections. We applied the framework to seventeen phenotypes and found well-replicated genes such as HERC2 and OCA2 for hair and eye color, and novel genes such as ZNF773 and PCNT for schizophrenia. Additionally, the framework identified ubiquitin mediated proteolysis, endocrine system and viral infectious diseases as most predictive biological pathways for schizophrenia. GenNet is a freely available, end-to-end deep learning framework that allows researchers to develop and use interpretable neural networks to obtain novel insights into the genetic architecture of complex traits and diseases.

Citing Articles

Learning genotype-phenotype associations from gaps in multi-species sequence alignments.

Islam U, Campelo Dos Santos A, Kanjilal R, Assis R Brief Bioinform. 2025; 26(1).

PMID: 39976386 PMC: 11840556. DOI: 10.1093/bib/bbaf022.


A mechanism-informed deep neural network enables prioritization of regulators that drive cell state transitions.

Xi X, Li J, Jia J, Meng Q, Li C, Wang X Nat Commun. 2025; 16(1):1284.

PMID: 39900922 PMC: 11790924. DOI: 10.1038/s41467-025-56475-9.


Genome-wide association neural networks identify genes linked to family history of Alzheimer's disease.

Ghose U, Sproviero W, Winchester L, Amin N, Zhu T, Newby D Brief Bioinform. 2025; 26(1.

PMID: 39775791 PMC: 11707606. DOI: 10.1093/bib/bbae704.


DNA promoter task-oriented dictionary mining and prediction model based on natural language technology.

Zeng R, Li Z, Li J, Zhang Q Sci Rep. 2025; 15(1):153.

PMID: 39747934 PMC: 11697570. DOI: 10.1038/s41598-024-84105-9.


Designing interpretable deep learning applications for functional genomics: a quantitative analysis.

van Hilten A, Katz S, Saccenti E, Niessen W, Roshchupkin G Brief Bioinform. 2024; 25(5).

PMID: 39293804 PMC: 11410376. DOI: 10.1093/bib/bbae449.


References
1.
Croft D, Mundo A, Haw R, Milacic M, Weiser J, Wu G . The Reactome pathway knowledgebase. Nucleic Acids Res. 2013; 42(Database issue):D472-7. PMC: 3965010. DOI: 10.1093/nar/gkt1102. View

2.
Yengo L, Sidorenko J, Kemper K, Zheng Z, Wood A, Weedon M . Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry. Hum Mol Genet. 2018; 27(20):3641-3649. PMC: 6488973. DOI: 10.1093/hmg/ddy271. View

3.
Gazestani V, Lewis N . From Genotype to Phenotype: Augmenting Deep Learning with Networks and Systems Biology. Curr Opin Syst Biol. 2019; 15:68-73. PMC: 6880750. DOI: 10.1016/j.coisb.2019.04.001. View

4.
Lee P, ODushlaine C, Thomas B, Purcell S . INRICH: interval-based enrichment analysis for genome-wide association studies. Bioinformatics. 2012; 28(13):1797-9. PMC: 3381960. DOI: 10.1093/bioinformatics/bts191. View

5.
. Biological insights from 108 schizophrenia-associated genetic loci. Nature. 2014; 511(7510):421-7. PMC: 4112379. DOI: 10.1038/nature13595. View