» Articles » PMID: 34484307

Landscape of the Dark Transcriptome Revealed Through Re-mining Massive RNA-Seq Data

Overview
Journal Front Genet
Date 2021 Sep 6
PMID 34484307
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

The "dark transcriptome" can be considered the multitude of sequences that are transcribed but not annotated as genes. We evaluated expression of 6,692 annotated genes and 29,354 unannotated open reading frames (ORFs) in the genome across diverse environmental, genetic and developmental conditions (3,457 RNA-Seq samples). Over 30% of the highly transcribed ORFs have translation evidence. Phylostratigraphic analysis infers most of these transcribed ORFs would encode species-specific proteins ("orphan-ORFs"); hundreds have mean expression comparable to annotated genes. These data reveal unannotated ORFs most likely to be protein-coding genes. We partitioned a co-expression matrix by Markov Chain Clustering; the resultant clusters contain 2,468 orphan-ORFs. We provide the aggregated RNA-Seq yeast data with extensive metadata as a project in MetaOmGraph (MOG), a tool designed for interactive analysis and visualization. This approach enables reuse of public RNA-Seq data for exploratory discovery, providing a rich context for experimentalists to make novel, experimentally testable hypotheses about candidate genes.

Citing Articles

An Interplay between Transcription Factors and Recombinant Protein Synthesis in at Transcriptional and Functional Levels-The Global View.

Gorczyca M, Korpys-Wozniak P, Celinska E Int J Mol Sci. 2024; 25(17).

PMID: 39273402 PMC: 11395014. DOI: 10.3390/ijms25179450.


Massively integrated coexpression analysis reveals transcriptional regulation, evolution and cellular implications of the yeast noncanonical translatome.

Rich A, Acar O, Carvunis A Genome Biol. 2024; 25(1):183.

PMID: 38978079 PMC: 11232214. DOI: 10.1186/s13059-024-03287-7.


Promoter recruitment drives the emergence of proto-genes in a long-term evolution experiment with Escherichia coli.

Uz-Zaman M, Dalton S, Barrick J, Ochman H PLoS Biol. 2024; 22(5):e3002418.

PMID: 38713714 PMC: 11101190. DOI: 10.1371/journal.pbio.3002418.


Large-scale Pan Genomic Analysis of Reveals Key Insights Into Molecular Evolutionary Rate of Specific Processes and Functions.

Bundhoo E, Ghoorah A, Jaufeerally-Fakim Y Evol Bioinform Online. 2024; 20:11769343241239463.

PMID: 38532808 PMC: 10964447. DOI: 10.1177/11769343241239463.


Thousands of Pristionchus pacificus orphan genes were integrated into developmental networks that respond to diverse environmental microbiota.

Athanasouli M, Akduman N, Roseler W, Theam P, Rodelsperger C PLoS Genet. 2023; 19(7):e1010832.

PMID: 37399201 PMC: 10348561. DOI: 10.1371/journal.pgen.1010832.


References
1.
Frith M, Forrest A, Nourbakhsh E, Pang K, Kai C, Kawai J . The abundance of short proteins in the mammalian proteome. PLoS Genet. 2006; 2(4):e52. PMC: 1449894. DOI: 10.1371/journal.pgen.0020052. View

2.
McLysaght A, Hurst L . Open questions in the study of de novo genes: what, how and why. Nat Rev Genet. 2016; 17(9):567-78. DOI: 10.1038/nrg.2016.78. View

3.
Llorens-Rico V, Cano J, Kamminga T, Gil R, Latorre A, Chen W . Bacterial antisense RNAs are mainly the product of transcriptional noise. Sci Adv. 2016; 2(3):e1501363. PMC: 4783119. DOI: 10.1126/sciadv.1501363. View

4.
Pelechano V, Wei W, Steinmetz L . Extensive transcriptional heterogeneity revealed by isoform profiling. Nature. 2013; 497(7447):127-31. PMC: 3705217. DOI: 10.1038/nature12121. View

5.
Xie C, Bekpen C, Kunzel S, Keshavarz M, Krebs-Wheaton R, Skrabar N . A de novo evolved gene in the house mouse regulates female pregnancy cycles. Elife. 2019; 8. PMC: 6760900. DOI: 10.7554/eLife.44392. View