A Simple Convolutional Neural Network for Prediction of Enhancer-promoter Interactions with DNA Sequence Data
Overview
Authors
Affiliations
Motivation: Enhancer-promoter interactions (EPIs) in the genome play an important role in transcriptional regulation. EPIs can be useful in boosting statistical power and enhancing mechanistic interpretation for disease- or trait-associated genetic variants in genome-wide association studies. Instead of expensive and time-consuming biological experiments, computational prediction of EPIs with DNA sequence and other genomic data is a fast and viable alternative. In particular, deep learning and other machine learning methods have been demonstrated with promising performance.
Results: First, using a published human cell line dataset, we demonstrate that a simple convolutional neural network (CNN) performs as well as, if no better than, a more complicated and state-of-the-art architecture, a hybrid of a CNN and a recurrent neural network. More importantly, in spite of the well-known cell line-specific EPIs (and corresponding gene expression), in contrast to the standard practice of training and predicting for each cell line separately, we propose two transfer learning approaches to training a model using all cell lines to various extents, leading to substantially improved predictive performance.
Availability And Implementation: Computer code is available at https://github.com/zzUMN/Combine-CNN-Enhancer-and-Promoters.
Supplementary Information: Supplementary data are available at Bioinformatics online.
Inferring protein from transcript abundances using convolutional neural networks.
Schwehn P, Falter-Braun P BioData Min. 2025; 18(1):18.
PMID: 40016737 PMC: 11866710. DOI: 10.1186/s13040-025-00434-z.
GATv2EPI: Predicting Enhancer-Promoter Interactions with a Dynamic Graph Attention Network.
Zhang T, Zhao X, Sun H, Gao B, Liu X Genes (Basel). 2025; 15(12.
PMID: 39766779 PMC: 11675151. DOI: 10.3390/genes15121511.
RAEPI: Predicting Enhancer-Promoter Interactions Based on Restricted Attention Mechanism.
Zhang W, Zhang M, Zhu M Interdiscip Sci. 2024; 17(1):153-165.
PMID: 39546160 DOI: 10.1007/s12539-024-00669-0.
Machine and Deep Learning Methods for Predicting 3D Genome Organization.
Wall B, Nguyen M, Harrell J, Dozmorov M Methods Mol Biol. 2024; 2856:357-400.
PMID: 39283464 DOI: 10.1007/978-1-0716-4136-1_22.
Ashayeri H, Sobhi N, Plawiak P, Pedrammehr S, Alizadehsani R, Jafarizadeh A Cancers (Basel). 2024; 16(11).
PMID: 38893257 PMC: 11171544. DOI: 10.3390/cancers16112138.