Evolutionary Analysis Across Mammals Reveals Distinct Classes of Long Non-coding RNAs
Overview
Authors
Affiliations
Background: Recent advances in transcriptome sequencing have enabled the discovery of thousands of long non-coding RNAs (lncRNAs) across many species. Though several lncRNAs have been shown to play important roles in diverse biological processes, the functions and mechanisms of most lncRNAs remain unknown. Two significant obstacles lie between transcriptome sequencing and functional characterization of lncRNAs: identifying truly non-coding genes from de novo reconstructed transcriptomes, and prioritizing the hundreds of resulting putative lncRNAs for downstream experimental interrogation.
Results: We present slncky, a lncRNA discovery tool that produces a high-quality set of lncRNAs from RNA-sequencing data and further uses evolutionary constraint to prioritize lncRNAs that are likely to be functionally important. Our automated filtering pipeline is comparable to manual curation efforts and more sensitive than previously published computational approaches. Furthermore, we developed a sensitive alignment pipeline for aligning lncRNA loci and propose new evolutionary metrics relevant for analyzing sequence and transcript evolution. Our analysis reveals that evolutionary selection acts in several distinct patterns, and uncovers two notable classes of intergenic lncRNAs: one showing strong purifying selection on RNA sequence and another where constraint is restricted to the regulation but not the sequence of the transcript.
Conclusion: Our results highlight that lncRNAs are not a homogenous class of molecules but rather a mixture of multiple functional classes with distinct biological mechanism and/or roles. Our novel comparative methods for lncRNAs reveals 233 constrained lncRNAs out of tens of thousands of currently annotated transcripts, which we make available through the slncky Evolution Browser.
Epigenetic Role of Long Non-coding RNAs in Multiple Myeloma.
Mehra N, Sundaram S, Shah P, Rao A Curr Oncol Rep. 2025; 27(1):37-44.
PMID: 39776330 DOI: 10.1007/s11912-024-01623-5.
Computational Resources for lncRNA Functions and Targetome.
Thakur A, Kumar M Methods Mol Biol. 2024; 2883:299-323.
PMID: 39702714 DOI: 10.1007/978-1-0716-4290-0_13.
Brain multi-omic Mendelian randomisation to identify novel drug targets for gliomagenesis.
Thornton Z, Andrews L, Zhao H, Zheng J, Paternoster L, Robinson J Hum Mol Genet. 2024; 34(2):178-192.
PMID: 39565278 PMC: 11780873. DOI: 10.1093/hmg/ddae168.
Quest for Orthologs in the Era of Biodiversity Genomics.
Langschied F, Bordin N, Cosentino S, Fuentes-Palacios D, Glover N, Hiller M Genome Biol Evol. 2024; 16(10).
PMID: 39404012 PMC: 11523110. DOI: 10.1093/gbe/evae224.
Exploring the Utility of Long Non-Coding RNAs for Assessing the Health Consequences of Vaping.
Besaratinia A, Blumenfeld H, Tommasi S Int J Mol Sci. 2024; 25(15).
PMID: 39126120 PMC: 11313266. DOI: 10.3390/ijms25158554.