» Articles » PMID: 31530582

Sequence Determinants of Polyadenylation-mediated Regulation

Overview
Journal Genome Res
Specialty Genetics
Date 2019 Sep 19
PMID 31530582
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

The cleavage and polyadenylation reaction is a crucial step in transcription termination and pre-mRNA maturation in human cells. Despite extensive research, the encoding of polyadenylation-mediated regulation of gene expression within the DNA sequence is not well understood. Here, we utilized a massively parallel reporter assay to inspect the effect of over 12,000 rationally designed polyadenylation sequences (PASs) on reporter gene expression and cleavage efficiency. We find that the PAS sequence can modulate gene expression by over five orders of magnitude. By using a uniquely designed scanning mutagenesis data set, we gain mechanistic insight into various modes of action by which the cleavage efficiency affects the sensitivity or robustness of the PAS to mutation. Furthermore, we employ motif discovery to identify both known and novel sequence motifs associated with PAS-mediated regulation. By leveraging the large scale of our data, we train a deep learning model for the highly accurate prediction of RNA levels from DNA sequence alone ( = 0.83). Moreover, we devise unique approaches for predicting exact cleavage sites for our reporter constructs and for endogenous transcripts. Taken together, our results expand our understanding of PAS-mediated regulation, and provide an unprecedented resource for analyzing and predicting PAS for regulatory genomics applications.

Citing Articles

Predicting RNA-seq coverage from DNA sequence as a unifying model of gene regulation.

Linder J, Srivastava D, Yuan H, Agarwal V, Kelley D Nat Genet. 2025; .

PMID: 39779956 DOI: 10.1038/s41588-024-02053-6.


Decoding biology with massively parallel reporter assays and machine learning.

La Fleur A, Shi Y, Seelig G Genes Dev. 2024; 38(17-20):843-865.

PMID: 39362779 PMC: 11535156. DOI: 10.1101/gad.351800.124.


Delineating yeast cleavage and polyadenylation signals using deep learning.

Stroup E, Ji Z Genome Res. 2024; 34(7):1066-1080.

PMID: 38914436 PMC: 11368178. DOI: 10.1101/gr.278606.123.


Deep learning of human polyadenylation sites at nucleotide resolution reveals molecular determinants of site usage and relevance in disease.

Stroup E, Ji Z Nat Commun. 2023; 14(1):7378.

PMID: 37968271 PMC: 10651852. DOI: 10.1038/s41467-023-43266-3.


Stress responses of plants through transcriptome plasticity by mRNA alternative polyadenylation.

Zhou J, Li Q Mol Hortic. 2023; 3(1):19.

PMID: 37789388 PMC: 10536700. DOI: 10.1186/s43897-023-00066-z.


References
1.
Lutz C, Moreira A . Alternative mRNA polyadenylation in eukaryotes: an effective regulator of gene expression. Wiley Interdiscip Rev RNA. 2011; 2(1):22-31. DOI: 10.1002/wrna.47. View

2.
Hans H, Alwine J . Functionally significant secondary structure of the simian virus 40 late polyadenylation signal. Mol Cell Biol. 2000; 20(8):2926-32. PMC: 85533. DOI: 10.1128/MCB.20.8.2926-2932.2000. View

3.
McDevitt M, Hart R, Wong W, Nevins J . Sequences capable of restoring poly(A) site function define two distinct downstream elements. EMBO J. 1986; 5(11):2907-13. PMC: 1167241. DOI: 10.1002/j.1460-2075.1986.tb04586.x. View

4.
Bailey T, Machanick P . Inferring direct DNA binding from ChIP-seq. Nucleic Acids Res. 2012; 40(17):e128. PMC: 3458523. DOI: 10.1093/nar/gks433. View

5.
Schek N, Cooke C, Alwine J . Definition of the upstream efficiency element of the simian virus 40 late polyadenylation signal by using in vitro analyses. Mol Cell Biol. 1992; 12(12):5386-93. PMC: 360476. DOI: 10.1128/mcb.12.12.5386-5393.1992. View