Widespread Non-modular Overlapping Codes in the Coding Regions
Overview
Authors
Affiliations
Messenger RNAs (mRNAs) consist of a coding region (open reading frame (ORF)) and two untranslated regions (UTRs), 5'UTR and 3'UTR. Ribosomes travel along the coding region, translating nucleotide triplets (called codons) to a chain of amino acids. The coding region was long believed to mainly encode the amino acid content of proteins, whereas regulatory signals reside in the UTRs and in other genomic regions. However, in recent years we have learned that the ORF is expansively populated with various regulatory signals, or codes, which are related to all gene expression steps and additional intracellular aspects. In this paper, we review the current knowledge related to overlapping codes inside the coding regions, such as the influence of synonymous codon usage on translation speed (and, in turn, the effect of translation speed on protein folding), ribosomal frameshifting, mRNA stability, methylation, splicing, transcription and more. All these codes come together and overlap in the ORF sequence, ensuring production of the right protein at the right time.
Modeling coding sequence design for virus-based expression in tobacco.
Burghardt M, Tuller T Synth Syst Biotechnol. 2025; 10(2):337-345.
PMID: 39802156 PMC: 11718241. DOI: 10.1016/j.synbio.2024.12.002.
Predicting gene sequences with AI to study codon usage patterns.
Sidi T, Bahiri-Elitzur S, Tuller T, Kolodny R Proc Natl Acad Sci U S A. 2024; 122(1):e2410003121.
PMID: 39739812 PMC: 11725940. DOI: 10.1073/pnas.2410003121.
Codon usage and expression-based features significantly improve prediction of CRISPR efficiency.
Bergman S, Tuller T NPJ Syst Biol Appl. 2024; 10(1):100.
PMID: 39227603 PMC: 11372048. DOI: 10.1038/s41540-024-00431-8.
Lynn N, Tuller T NPJ Syst Biol Appl. 2024; 10(1):25.
PMID: 38453965 PMC: 10920900. DOI: 10.1038/s41540-024-00351-7.
The Effects of Codon Usage on Protein Structure and Folding.
Moss M, Chamness L, Clark P Annu Rev Biophys. 2023; 53(1):87-108.
PMID: 38134335 PMC: 11227313. DOI: 10.1146/annurev-biophys-030722-020555.