Alternative ORFs and Small ORFs: Shedding Light on the Dark Proteome
Overview
Affiliations
Traditional annotation of protein-encoding genes relied on assumptions, such as one open reading frame (ORF) encodes one protein and minimal lengths for translated proteins. With the serendipitous discoveries of translated ORFs encoded upstream and downstream of annotated ORFs, from alternative start sites nested within annotated ORFs and from RNAs previously considered noncoding, it is becoming clear that these initial assumptions are incorrect. The findings have led to the realization that genetic information is more densely coded and that the proteome is more complex than previously anticipated. As such, interest in the identification and characterization of the previously ignored 'dark proteome' is increasing, though we note that research in eukaryotes and bacteria has largely progressed in isolation. To bridge this gap and illustrate exciting findings emerging from studies of the dark proteome, we highlight recent advances in both eukaryotic and bacterial cells. We discuss progress in the detection of alternative ORFs as well as in the understanding of functions and the regulation of their expression and posit questions for future work.
The hidden bacterial microproteome.
Fesenko I, Sahakyan H, Dhyani R, Shabalina S, Storz G, Koonin E Mol Cell. 2025; 85(5):1024-1041.e6.
PMID: 39978337 PMC: 11890958. DOI: 10.1016/j.molcel.2025.01.025.
Unveiling conserved HIV-1 open reading frames encoding T cell antigens using ribosome profiling.
Bertrand L, Nelde A, Ramirez B, Hatin I, Arbes H, Francois P Nat Commun. 2025; 16(1):1707.
PMID: 39966340 PMC: 11836469. DOI: 10.1038/s41467-025-56773-2.
Wang Q, Qin B, Yu H, Zeng J, Fan J, Wu Q Sci Rep. 2025; 15(1):4649.
PMID: 39920301 PMC: 11805973. DOI: 10.1038/s41598-025-89275-8.
sORFdb - a database for sORFs, small proteins, and small protein families in bacteria.
Hahnfeld J, Schwengers O, Jelonek L, Diedrich S, Cemic F, Goesmann A BMC Genomics. 2025; 26(1):110.
PMID: 39910485 PMC: 11796252. DOI: 10.1186/s12864-025-11301-w.
Wang Q, Mao Y Adv Biotechnol (Singap). 2025; 1(4):6.
PMID: 39883220 PMC: 11727582. DOI: 10.1007/s44307-023-00006-4.