Non-coding RNA Prediction and Verification in Saccharomyces Cerevisiae
Overview
Authors
Affiliations
Non-coding RNA (ncRNA) play an important and varied role in cellular function. A significant amount of research has been devoted to computational prediction of these genes from genomic sequence, but the ability to do so has remained elusive due to a lack of apparent genomic features. In this work, thermodynamic stability of ncRNA structural elements, as summarized in a Z-score, is used to predict ncRNA in the yeast Saccharomyces cerevisiae. This analysis was coupled with comparative genomics to search for ncRNA genes on chromosome six of S. cerevisiae and S. bayanus. Sets of positive and negative control genes were evaluated to determine the efficacy of thermodynamic stability for discriminating ncRNA from background sequence. The effect of window sizes and step sizes on the sensitivity of ncRNA identification was also explored. Non-coding RNA gene candidates, common to both S. cerevisiae and S. bayanus, were verified using northern blot analysis, rapid amplification of cDNA ends (RACE), and publicly available cDNA library data. Four ncRNA transcripts are well supported by experimental data (RUF10, RUF11, RUF12, RUF13), while one additional putative ncRNA transcript is well supported but the data are not entirely conclusive. Six candidates appear to be structural elements in 5' or 3' untranslated regions of annotated protein-coding genes. This work shows that thermodynamic stability, coupled with comparative genomics, can be used to predict ncRNA with significant structural elements.
Mathur K, Singh B, Puria R, Nain V Arch Microbiol. 2024; 206(6):253.
PMID: 38727738 DOI: 10.1007/s00203-024-03969-7.
Discovery of 17 conserved structural RNAs in fungi.
Gao W, Jones T, Rivas E Nucleic Acids Res. 2021; 49(11):6128-6143.
PMID: 34086938 PMC: 8216456. DOI: 10.1093/nar/gkab355.
Large-scale profiling of noncoding RNA function in yeast.
Parker S, Fraczek M, Wu J, Shamsah S, Manousaki A, Dungrattanalert K PLoS Genet. 2018; 14(3):e1007253.
PMID: 29529031 PMC: 5864082. DOI: 10.1371/journal.pgen.1007253.
RNAStructuromeDB: A genome-wide database for RNA structural inference.
Andrews R, Baber L, Moss W Sci Rep. 2017; 7(1):17269.
PMID: 29222504 PMC: 5722888. DOI: 10.1038/s41598-017-17510-y.
Mycoplasma non-coding RNA: identification of small RNAs and targets.
Siqueira F, Loss de Morais G, Higashi S, Beier L, Breyer G, de Sa Godinho C BMC Genomics. 2016; 17(Suppl 8):743.
PMID: 27801290 PMC: 5088518. DOI: 10.1186/s12864-016-3061-z.