» Articles » PMID: 30649185

A Simple Convolutional Neural Network for Prediction of Enhancer-promoter Interactions with DNA Sequence Data

Overview
Journal Bioinformatics
Specialty Biology
Date 2019 Jan 17
PMID 30649185
Citations 26
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Enhancer-promoter interactions (EPIs) in the genome play an important role in transcriptional regulation. EPIs can be useful in boosting statistical power and enhancing mechanistic interpretation for disease- or trait-associated genetic variants in genome-wide association studies. Instead of expensive and time-consuming biological experiments, computational prediction of EPIs with DNA sequence and other genomic data is a fast and viable alternative. In particular, deep learning and other machine learning methods have been demonstrated with promising performance.

Results: First, using a published human cell line dataset, we demonstrate that a simple convolutional neural network (CNN) performs as well as, if no better than, a more complicated and state-of-the-art architecture, a hybrid of a CNN and a recurrent neural network. More importantly, in spite of the well-known cell line-specific EPIs (and corresponding gene expression), in contrast to the standard practice of training and predicting for each cell line separately, we propose two transfer learning approaches to training a model using all cell lines to various extents, leading to substantially improved predictive performance.

Availability And Implementation: Computer code is available at https://github.com/zzUMN/Combine-CNN-Enhancer-and-Promoters.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Inferring protein from transcript abundances using convolutional neural networks.

Schwehn P, Falter-Braun P BioData Min. 2025; 18(1):18.

PMID: 40016737 PMC: 11866710. DOI: 10.1186/s13040-025-00434-z.


GATv2EPI: Predicting Enhancer-Promoter Interactions with a Dynamic Graph Attention Network.

Zhang T, Zhao X, Sun H, Gao B, Liu X Genes (Basel). 2025; 15(12.

PMID: 39766779 PMC: 11675151. DOI: 10.3390/genes15121511.


RAEPI: Predicting Enhancer-Promoter Interactions Based on Restricted Attention Mechanism.

Zhang W, Zhang M, Zhu M Interdiscip Sci. 2024; 17(1):153-165.

PMID: 39546160 DOI: 10.1007/s12539-024-00669-0.


Machine and Deep Learning Methods for Predicting 3D Genome Organization.

Wall B, Nguyen M, Harrell J, Dozmorov M Methods Mol Biol. 2024; 2856:357-400.

PMID: 39283464 DOI: 10.1007/978-1-0716-4136-1_22.


Transfer Learning in Cancer Genetics, Mutation Detection, Gene Expression Analysis, and Syndrome Recognition.

Ashayeri H, Sobhi N, Plawiak P, Pedrammehr S, Alizadehsani R, Jafarizadeh A Cancers (Basel). 2024; 16(11).

PMID: 38893257 PMC: 11171544. DOI: 10.3390/cancers16112138.


References
1.
Whalen S, Truty R, Pollard K . Enhancer-promoter interactions are encoded by complex genomic signatures on looping chromatin. Nat Genet. 2016; 48(5):488-96. PMC: 4910881. DOI: 10.1038/ng.3539. View

2.
Rao S, Huntley M, Durand N, Stamenova E, Bochkov I, Robinson J . A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014; 159(7):1665-80. PMC: 5635824. DOI: 10.1016/j.cell.2014.11.021. View

3.
LeCun Y, Bengio Y, Hinton G . Deep learning. Nature. 2015; 521(7553):436-44. DOI: 10.1038/nature14539. View

4.
Cao Q, Anyansi C, Hu X, Xu L, Xiong L, Tang W . Reconstruction of enhancer-target networks in 935 samples of human primary cells, tissues and cell lines. Nat Genet. 2017; 49(10):1428-1436. DOI: 10.1038/ng.3950. View

5.
Yang Y, Zhang R, Singh S, Ma J . Exploiting sequence-based features for predicting enhancer-promoter interactions. Bioinformatics. 2017; 33(14):i252-i260. PMC: 5870728. DOI: 10.1093/bioinformatics/btx257. View