Thousands of Human Non-AUG Extended Proteoforms Lack Evidence of Evolutionary Selection Among Mammals
Overview
Affiliations
The synthesis of most proteins begins at AUG codons, yet a small number of non-AUG initiated proteoforms are also known. Here we analyse a large number of publicly available Ribo-seq datasets to identify novel, previously uncharacterised non-AUG proteoforms using Trips-Viz implementation of a novel algorithm for detecting translated ORFs. In parallel we analyse genomic alignment of 120 mammals to identify evidence of protein coding evolution in sequences encoding potential extensions. Unexpectedly we find that the number of non-AUG proteoforms identified with ribosome profiling data greatly exceeds those with strong phylogenetic support suggesting their recent evolution. Our study argues that the protein coding potential of human genome greatly exceeds that detectable through comparative genomics and exposes the existence of multiple proteins encoded by the same genomic loci.
Rodriguez J, Maquedano M, Cerdan-Velez D, Calvo E, Vazquez J, Tress M bioRxiv. 2024; .
PMID: 39605392 PMC: 11601488. DOI: 10.1101/2024.11.14.623419.
Lee P, Sun Y, Soares A, Fai C, Picciotto M, Guo J Mol Cell. 2024; 84(20):3967-3978.e8.
PMID: 39317199 PMC: 11490368. DOI: 10.1016/j.molcel.2024.08.032.
Evidence for widespread translation of 5' untranslated regions.
Rodriguez J, Abascal F, Cerdan-Velez D, Gomez L, Vazquez J, Tress M Nucleic Acids Res. 2024; 52(14):8112-8126.
PMID: 38953162 PMC: 11317171. DOI: 10.1093/nar/gkae571.
Upstream open reading frames: new players in the landscape of cancer gene regulation.
Dasgupta A, Prensner J NAR Cancer. 2024; 6(2):zcae023.
PMID: 38774471 PMC: 11106035. DOI: 10.1093/narcan/zcae023.
Ribosome decision graphs for the representation of eukaryotic RNA translation complexity.
Tierney J, Swirski M, Tjeldnes H, Mudge J, Kufel J, Whiffin N Genome Res. 2024; 34(4):530-538.
PMID: 38719470 PMC: 11146595. DOI: 10.1101/gr.278810.123.