» Articles » PMID: 33495453

Pan-cancer Analysis of Transcripts Encoding Novel Open-reading Frames (nORFs) and Their Potential Biological Functions

Abstract

Uncharacterized and unannotated open-reading frames, which we refer to as novel open reading frames (nORFs), may sometimes encode peptides that remain unexplored for novel therapeutic opportunities. To our knowledge, no systematic identification and characterization of transcripts encoding nORFs or their translation products in cancer, or in any other physiological process has been performed. We use our curated nORFs database (nORFs.org), together with RNA-Seq data from The Cancer Genome Atlas (TCGA) and Genotype-Expression (GTEx) consortiums, to identify transcripts containing nORFs that are expressed frequently in cancer or matched normal tissue across 22 cancer types. We show nORFs are subject to extensive dysregulation at the transcript level in cancer tissue and that a small subset of nORFs are associated with overall patient survival, suggesting that nORFs may have prognostic value. We also show that nORF products can form protein-like structures with post-translational modifications. Finally, we perform in silico screening for inhibitors against nORF-encoded proteins that are disrupted in stomach and esophageal cancer, showing that they can potentially be targeted by inhibitors. We hope this work will guide and motivate future studies that perform in-depth characterization of nORF functions in cancer and other diseases.

Citing Articles

Exploring the Dark Matter of Human Proteome: The Emerging Role of Non-Canonical Open Reading Frame (ncORF) in Cancer Diagnosis, Biology, and Therapy.

Ge A, Chan C, Yang X Cancers (Basel). 2024; 16(15).

PMID: 39123386 PMC: 11311765. DOI: 10.3390/cancers16152660.


Noncanonical microprotein regulation of immunity.

Nichols C, Do-Thi V, Peltier D Mol Ther. 2024; 32(9):2905-2929.

PMID: 38734902 PMC: 11403233. DOI: 10.1016/j.ymthe.2024.05.021.


Leveraging a disulfidptosis‑related lncRNAs signature for predicting the prognosis and immunotherapy of glioma.

Chen D, Li Q, Xu Y, Wei Y, Li J, Zhu X Cancer Cell Int. 2023; 23(1):316.

PMID: 38066643 PMC: 10709922. DOI: 10.1186/s12935-023-03147-7.


Microproteins-Discovery, structure, and function.

Mohsen J, Martel A, Slavoff S Proteomics. 2023; 23(23-24):e2100211.

PMID: 37603371 PMC: 10841188. DOI: 10.1002/pmic.202100211.


What Can Ribo-Seq, Immunopeptidomics, and Proteomics Tell Us About the Noncanonical Proteome?.

Prensner J, Abelin J, Kok L, Clauser K, Mudge J, Ruiz-Orera J Mol Cell Proteomics. 2023; 22(9):100631.

PMID: 37572790 PMC: 10506109. DOI: 10.1016/j.mcpro.2023.100631.


References
1.
Dosztanyi Z, Meszaros B, Simon I . ANCHOR: web server for predicting protein binding regions in disordered proteins. Bioinformatics. 2009; 25(20):2745-6. PMC: 2759549. DOI: 10.1093/bioinformatics/btp518. View

2.
Jones P, Binns D, Chang H, Fraser M, Li W, McAnulla C . InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014; 30(9):1236-40. PMC: 3998142. DOI: 10.1093/bioinformatics/btu031. View

3.
Vivian J, Rao A, Nothaft F, Ketchum C, Armstrong J, Novak A . Toil enables reproducible, open source, big biomedical data analyses. Nat Biotechnol. 2017; 35(4):314-316. PMC: 5546205. DOI: 10.1038/nbt.3772. View

4.
Brunet M, Brunelle M, Lucier J, Delcourt V, Levesque M, Grenier F . OpenProt: a more comprehensive guide to explore eukaryotic coding potential and proteomes. Nucleic Acids Res. 2018; 47(D1):D403-D410. PMC: 6323990. DOI: 10.1093/nar/gky936. View

5.
Ji Z, Song R, Regev A, Struhl K . Many lncRNAs, 5'UTRs, and pseudogenes are translated and some are likely to express functional proteins. Elife. 2015; 4:e08890. PMC: 4739776. DOI: 10.7554/eLife.08890. View