Differential Requirements for MRNA Folding Partially Explain Why Highly Expressed Proteins Evolve Slowly
Overview
Affiliations
The cause of the tremendous among-protein variation in the rate of sequence evolution is a central subject of molecular evolution. Expression level has been identified as a leading determinant of this variation among genes encoded in the same genome, but the underlying mechanisms are not fully understood. We here propose and demonstrate that a requirement for stronger folding of more abundant mRNAs results in slower evolution of more highly expressed genes and proteins. Specifically, we show that: (i) the higher the expression level of a gene, the greater the selective pressure for its mRNA to fold; (ii) random mutations are more likely to decrease mRNA folding when occurring in highly expressed genes than in lowly expressed genes; and (iii) amino acid substitution rate is negatively correlated with mRNA folding strength, with or without the control of expression level. Furthermore, synonymous (d(S)) and nonsynonymous (d(N)) nucleotide substitution rates are both negatively correlated with mRNA folding strength. However, counterintuitively, d(S) and d(N) are differentially constrained by selection for mRNA folding, resulting in a significant correlation between mRNA folding strength and d(N)/d(S), even when gene expression level is controlled. The direction and magnitude of this correlation is determined primarily by the G+C frequency at third codon positions. Together, these findings explain why highly expressed genes evolve slowly, demonstrate a major role of natural selection at the mRNA level in constraining protein evolution, and reveal a previously unrecognized and unexpected form of nonprotein-level selection that impacts d(N)/d(S).
Hao J, Liang Y, Wang T, Su Y BMC Plant Biol. 2025; 25(1):134.
PMID: 39893444 PMC: 11786343. DOI: 10.1186/s12870-025-06157-x.
Synthetic rational design of live-attenuated Zika viruses based on a computational model.
Roopin M, Zafrir Z, Siridechadilok B, Suphatrakul A, Julander J, Tuller T Nucleic Acids Res. 2025; 53(2).
PMID: 39797731 PMC: 11724363. DOI: 10.1093/nar/gkae1313.
Further Evidence for Strong Nonneutrality of Yeast Synonymous Mutations.
Shen X, Song S, Li C, Zhang J Mol Biol Evol. 2024; 41(11).
PMID: 39467337 PMC: 11562845. DOI: 10.1093/molbev/msae224.
Codon Usage Bias: A Potential Factor Affecting VGLUT Developmental Expression and Protein Evolution.
Zhao Y, Zhang Y, Feng J, He Z, Li T Mol Neurobiol. 2024; 62(3):3508-3522.
PMID: 39305444 DOI: 10.1007/s12035-024-04426-8.
Patterns of Change in Nucleotide Diversity Over Gene Length.
Ali F Genome Biol Evol. 2024; 16(4).
PMID: 38608148 PMC: 11040516. DOI: 10.1093/gbe/evae078.