» Articles » PMID: 30537930

Identification and Characterization of Novel Conserved RNA Structures in Drosophila

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2018 Dec 13
PMID 30537930
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Comparative genomics approaches have facilitated the discovery of many novel non-coding and structured RNAs (ncRNAs). The increasing availability of related genomes now makes it possible to systematically search for compensatory base changes - and thus for conserved secondary structures - even in genomic regions that are poorly alignable in the primary sequence. The wealth of available transcriptome data can add valuable insight into expression and possible function for new ncRNA candidates. Earlier work identifying ncRNAs in Drosophila melanogaster made use of sequence-based alignments and employed a sliding window approach, inevitably biasing identification toward RNAs encoded in the more conserved parts of the genome.

Results: To search for conserved RNA structures (CRSs) that may not be highly conserved in sequence and to assess the expression of CRSs, we conducted a genome-wide structural alignment screen of 27 insect genomes including D. melanogaster and integrated this with an extensive set of tiling array data. The structural alignment screen revealed ∼30,000 novel candidate CRSs at an estimated false discovery rate of less than 10%. With more than one quarter of all individual CRS motifs showing sequence identities below 60%, the predicted CRSs largely complement the findings of sliding window approaches applied previously. While a sixth of the CRSs were ubiquitously expressed, we found that most were expressed in specific developmental stages or cell lines. Notably, most statistically significant enrichment of CRSs were observed in pupae, mainly in exons of untranslated regions, promotors, enhancers, and long ncRNAs. Interestingly, cell lines were found to express a different set of CRSs than were found in vivo. Only a small fraction of intergenic CRSs were co-expressed with the adjacent protein coding genes, which suggests that most intergenic CRSs are independent genetic units.

Conclusions: This study provides a more comprehensive view of the ncRNA transcriptome in fly as well as evidence for differential expression of CRSs during development and in cell lines.

Citing Articles

Tailored machine learning models for functional RNA detection in genome-wide screens.

Klapproth C, Zotzsche S, Kuhnl F, Fallmann J, Stadler P, Findeiss S NAR Genom Bioinform. 2023; 5(3):lqad072.

PMID: 37608800 PMC: 10440787. DOI: 10.1093/nargab/lqad072.


In silico methods for predicting functional synonymous variants.

Lin B, Katneni U, Jankowska K, Meyer D, Kimchi-Sarfaty C Genome Biol. 2023; 24(1):126.

PMID: 37217943 PMC: 10204308. DOI: 10.1186/s13059-023-02966-1.


Synonymous variants that disrupt messenger RNA structure are significantly constrained in the human population.

Gaither J, Lammi G, Li J, Gordon D, Kuck H, Kelly B Gigascience. 2021; 10(4).

PMID: 33822938 PMC: 8023685. DOI: 10.1093/gigascience/giab023.


SSS-test: a novel test for detecting positive selection on RNA secondary structure.

Walter Costa M, Honer Zu Siederdissen C, Dunjic M, Stadler P, Nowick K BMC Bioinformatics. 2019; 20(1):151.

PMID: 30898084 PMC: 6429701. DOI: 10.1186/s12859-019-2711-y.


Transcriptomic analyses reveal groups of co-expressed, syntenic lncRNAs in four species of the genus Caenorhabditis.

Pegueroles C, Iraola-Guzman S, Chorostecki U, Ksiezopolska E, Saus E, Gabaldon T RNA Biol. 2019; 16(3):320-329.

PMID: 30691342 PMC: 6380332. DOI: 10.1080/15476286.2019.1572438.

References
1.
Torarinsson E, Sawera M, Havgaard J, Fredholm M, Gorodkin J . Thousands of corresponding human and mouse genomic regions unalignable in primary sequence contain common RNA structure. Genome Res. 2006; 16(7):885-9. PMC: 1484455. DOI: 10.1101/gr.5226606. View

2.
Gesell T, Washietl S . Dinucleotide controlled null models for comparative RNA gene prediction. BMC Bioinformatics. 2008; 9:248. PMC: 2453142. DOI: 10.1186/1471-2105-9-248. View

3.
Reiche K, Stadler P . RNAstrand: reading direction of structured RNAs in multiple sequence alignments. Algorithms Mol Biol. 2007; 2:6. PMC: 1892782. DOI: 10.1186/1748-7188-2-6. View

4.
Workman C, KROGH A . No evidence that mRNAs have lower folding free energies than random sequences with the same dinucleotide distribution. Nucleic Acids Res. 1999; 27(24):4816-22. PMC: 148783. DOI: 10.1093/nar/27.24.4816. View

5.
Rivas E, Eddy S . Secondary structure alone is generally not statistically significant for the detection of noncoding RNAs. Bioinformatics. 2000; 16(7):583-605. DOI: 10.1093/bioinformatics/16.7.583. View