» Articles » PMID: 28138077

Both Maintenance and Avoidance of RNA-Binding Protein Interactions Constrain Coding Sequence Evolution

Overview
Journal Mol Biol Evol
Specialty Biology
Date 2017 Feb 1
PMID 28138077
Citations 16
Authors
Affiliations
Soon will be listed here.
Abstract

While the principal force directing coding sequence (CDS) evolution is selection on protein function, to ensure correct gene expression CDSs must also maintain interactions with RNA-binding proteins (RBPs). Understanding how our genes are shaped by these RNA-level pressures is necessary for diagnostics and for improving transgenes. However, the evolutionary impact of the need to maintain RBP interactions remains unresolved. Are coding sequences constrained by the need to specify RBP binding motifs? If so, what proportion of mutations are affected? Might sequence evolution also be constrained by the need not to specify motifs that might attract unwanted binding, for instance because it would interfere with exon definition? Here, we have scanned human CDSs for motifs that have been experimentally determined to be recognized by RBPs. We observe two sets of motifs-those that are enriched over nucleotide-controlled null and those that are depleted. Importantly, the depleted set is enriched for motifs recognized by non-CDS binding RBPs. Supporting the functional relevance of our observations, we find that motifs that are more enriched are also slower-evolving. The net effect of this selection to preserve is a reduction in the over-all rate of synonymous evolution of 2-3% in both primates and rodents. Stronger motif depletion, on the other hand, is associated with stronger selection against motif gain in evolution. The challenge faced by our CDSs is therefore not only one of attracting the right RBPs but also of avoiding the wrong ones, all while also evolving under selection pressures related to protein structure.

Citing Articles

Interpreting mammalian synonymous site conservation in light of the unwanted transcript hypothesis.

Christmas M, Dong M, Meadows J, Kozyrev S, Lindblad-Toh K Nat Commun. 2025; 16(1):2007.

PMID: 40011430 PMC: 11865589. DOI: 10.1038/s41467-025-57179-w.


Selection on synonymous sites: the unwanted transcript hypothesis.

Radrizzani S, Kudla G, Izsvak Z, Hurst L Nat Rev Genet. 2024; 25(6):431-448.

PMID: 38297070 DOI: 10.1038/s41576-023-00686-7.


Molecular characteristics and clinical implications of serine/arginine-rich splicing factors in human cancer.

Zhang J, Fang Z, Song C Aging (Albany NY). 2023; 15(22):13287-13311.

PMID: 38015716 PMC: 10713412. DOI: 10.18632/aging.205241.


Protein Binding to Cis-Motifs in mRNAs Coding Sequence Is Common and Regulates Transcript Stability and the Rate of Translation.

Grzybowska E, Wakula M Cells. 2021; 10(11).

PMID: 34831133 PMC: 8616275. DOI: 10.3390/cells10112910.


Synonymous variants that disrupt messenger RNA structure are significantly constrained in the human population.

Gaither J, Lammi G, Li J, Gordon D, Kuck H, Kelly B Gigascience. 2021; 10(4).

PMID: 33822938 PMC: 8023685. DOI: 10.1093/gigascience/giab023.


References
1.
Kao D, Aldridge G, Weiler I, Greenough W . Altered mRNA transport, docking, and protein translation in neurons lacking fragile X mental retardation protein. Proc Natl Acad Sci U S A. 2010; 107(35):15601-6. PMC: 2932564. DOI: 10.1073/pnas.1010564107. View

2.
Zhou Z, Fu X . Regulation of splicing by SR proteins and SR protein-specific kinases. Chromosoma. 2013; 122(3):191-207. PMC: 3660409. DOI: 10.1007/s00412-013-0407-z. View

3.
Li X, Kazan H, Lipshitz H, Morris Q . Finding the target sites of RNA-binding proteins. Wiley Interdiscip Rev RNA. 2013; 5(1):111-30. PMC: 4253089. DOI: 10.1002/wrna.1201. View

4.
Muller-McNicoll M, Neugebauer K . How cells get the message: dynamic assembly and function of mRNA-protein complexes. Nat Rev Genet. 2013; 14(4):275-87. DOI: 10.1038/nrg3434. View

5.
Pedersen J, Bejerano G, Siepel A, Rosenbloom K, Lindblad-Toh K, Lander E . Identification and classification of conserved RNA secondary structures in the human genome. PLoS Comput Biol. 2006; 2(4):e33. PMC: 1440920. DOI: 10.1371/journal.pcbi.0020033. View