» Articles » PMID: 38859587

DeepCBA: A Deep Learning Framework for Gene Expression Prediction in Maize Based on DNA Sequences and Chromatin Interactions

Overview
Journal Plant Commun
Specialty Biology
Date 2024 Jun 11
PMID 38859587
Authors
Affiliations
Soon will be listed here.
Abstract

Chromatin interactions create spatial proximity between distal regulatory elements and target genes in the genome, which has an important impact on gene expression, transcriptional regulation, and phenotypic traits. To date, several methods have been developed for predicting gene expression. However, existing methods do not take into consideration the effect of chromatin interactions on target gene expression, thus potentially reducing the accuracy of gene expression prediction and mining of important regulatory elements. In this study, we developed a highly accurate deep learning-based gene expression prediction model (DeepCBA) based on maize chromatin interaction data. Compared with existing models, DeepCBA exhibits higher accuracy in expression classification and expression value prediction. The average Pearson correlation coefficients (PCCs) for predicting gene expression using gene promoter proximal interactions, proximal-distal interactions, and both proximal and distal interactions were 0.818, 0.625, and 0.929, respectively, representing an increase of 0.357, 0.16, and 0.469 over the PCCs obtained with traditional methods that use only gene proximal sequences. Some important motifs were identified through DeepCBA; they were enriched in open chromatin regions and expression quantitative trait loci and showed clear tissue specificity. Importantly, experimental results for the maize flowering-related gene ZmRap2.7 and the tillering-related gene ZmTb1 demonstrated the feasibility of DeepCBA for exploration of regulatory elements that affect gene expression. Moreover, promoter editing and verification of two reported genes (ZmCLE7 and ZmVTE4) demonstrated the utility of DeepCBA for the precise design of gene expression and even for future intelligent breeding. DeepCBA is available at http://www.deepcba.com/ or http://124.220.197.196/.

References
1.
Grant C, Bailey T, Noble W . FIMO: scanning for occurrences of a given motif. Bioinformatics. 2011; 27(7):1017-8. PMC: 3065696. DOI: 10.1093/bioinformatics/btr064. View

2.
Tasaki S, Gaiteri C, Mostafavi S, Wang Y . Deep learning decodes the principles of differential gene expression. Nat Mach Intell. 2020; 2(7):376-386. PMC: 7363043. DOI: 10.1038/s42256-020-0201-6. View

3.
Woodhouse M, Cannon E, Portwood 2nd J, Harper L, Gardiner J, Schaeffer M . A pan-genomic approach to genome databases using maize as a model system. BMC Plant Biol. 2021; 21(1):385. PMC: 8377966. DOI: 10.1186/s12870-021-03173-5. View

4.
Ricci W, Lu Z, Ji L, Marand A, Ethridge C, Murphy N . Widespread long-range cis-regulatory elements in the maize genome. Nat Plants. 2019; 5(12):1237-1249. PMC: 6904520. DOI: 10.1038/s41477-019-0547-0. View

5.
Schmidt F, Gasparoni N, Gasparoni G, Gianmoena K, Cadenas C, Polansky J . Combining transcription factor binding affinities with open-chromatin data for accurate gene expression prediction. Nucleic Acids Res. 2016; 45(1):54-66. PMC: 5224477. DOI: 10.1093/nar/gkw1061. View