Computational Analysis of Functional Long Noncoding RNAs Reveals Lack of Peptide-coding Capacity and Parallels with 3' UTRs
Overview
Authors
Affiliations
Recent transcriptome analyses have indicated that a large part of mammalian genomes are transcribed into long non-protein-coding RNAs (lncRNAs). However, only a very small fraction of them have been individually studied, and whether the majority of lncRNAs found in large-scale studies have a cellular role is debated. To gain insight into the sequence features and genomic architecture of the subset of lncRNAs that have been proven to be functional, we created a database containing studied lncRNAs manually culled from the literature along with a parallel database containing all annotated protein-coding human RNAs. The Functional lncRNA Database, which contains 204 lncRNAs and their splicing variants, is available at valadkhanlab.org/database. Analysis of the lncRNAs and their comparison to protein-coding transcripts revealed sequence features including paucity of introns and low GC content in lncRNAs, which could explain several biological characteristics of these transcripts, such as their nuclear localization and low expression level. The predicted ORFs in lncRNAs have poor start codon and ORF contexts, which would lead to activation of the nonsense-mediated decay pathways and thus make it unlikely for most lncRNAs to code for even short peptides. Interestingly, our analyses revealed significant similarities between the lncRNAs and the 3' untranslated regions (3' UTRs) in protein-coding RNAs in structural features and sequence composition. The presence of these intriguing parallels between the lncRNAs and 3' UTRs, which constitute the two main components of the RNA-mediated cellular regulatory system, indicates that highly similar evolutionary constraints govern the function of regulatory RNA sequences in the cell.
Mitochondrial sequencing identifies long noncoding RNA features that promote binding to PNPase.
Taylor A, Hathaway Q, Kunovac A, Pinti M, Newman M, Cook C Am J Physiol Cell Physiol. 2024; 327(2):C221-C236.
PMID: 38826135 PMC: 11427107. DOI: 10.1152/ajpcell.00648.2023.
Das J, Kumar B, Saha B, Jaiswal S, Iquebal M, Angadi U Front Genet. 2023; 14:1239434.
PMID: 38090151 PMC: 10713812. DOI: 10.3389/fgene.2023.1239434.
Zheng W, Chen Y, Wang Y, Chen S, Xu X Int J Mol Sci. 2023; 24(21).
PMID: 37958851 PMC: 10648414. DOI: 10.3390/ijms242115870.
Tiwari S, Jain M, Singla-Pareek S, Bhalla P, Singh M, Pareek A Int J Mol Sci. 2023; 24(14).
PMID: 37511436 PMC: 10380863. DOI: 10.3390/ijms241411677.
Long non-coding RNAs in non-small cell lung cancer: implications for EGFR-TKI resistance.
Liu D, Lu X, Huang W, Zhuang W Front Genet. 2023; 14:1222059.
PMID: 37456663 PMC: 10349551. DOI: 10.3389/fgene.2023.1222059.