Classification and Clustering of RNA Crosslink-ligation Data Reveal Complex Structures and Homodimers
Overview
Authors
Affiliations
The recent development and application of methods based on the general principle of "crosslinking and proximity ligation" (crosslink-ligation) are revolutionizing RNA structure studies in living cells. However, extracting structure information from such data presents unique challenges. Here, we introduce a set of computational tools for the systematic analysis of data from a wide variety of crosslink-ligation methods, specifically focusing on read mapping, alignment classification, and clustering. We design a new strategy to map short reads with irregular gaps at high sensitivity and specificity. Analysis of previously published data reveals distinct properties and bias caused by the crosslinking reactions. We perform rigorous and exhaustive classification of alignments and discover eight types of arrangements that provide distinct information on RNA structures and interactions. To deconvolve the dense and intertwined gapped alignments, we develop a network/graph-based tool Crosslinked RNA Secondary Structure Analysis using Network Techniques (CRSSANT), which enables clustering of gapped alignments and discovery of new alternative and dynamic conformations. We discover that multiple crosslinking and ligation events can occur on the same RNA, generating multisegment alignments to report complex high-level RNA structures and multi-RNA interactions. We find that alignments with overlapped segments are produced from potential homodimers and develop a new method for their de novo identification. Analysis of overlapping alignments revealed potential new homodimers in cellular noncoding RNAs and RNA virus genomes in the family. Together, this suite of computational tools enables rapid and efficient analysis of RNA structure and interaction data in living cells.
snoRNA-facilitated protein secretion revealed by transcriptome-wide snoRNA target identification.
Liu B, Wu T, Miao B, Ji F, Liu S, Wang P Cell. 2024; 188(2):465-483.e22.
PMID: 39579764 PMC: 11761385. DOI: 10.1016/j.cell.2024.10.046.
Identification of RNA structures and their roles in RNA functions.
Cao X, Zhang Y, Ding Y, Wan Y Nat Rev Mol Cell Biol. 2024; 25(10):784-801.
PMID: 38926530 DOI: 10.1038/s41580-024-00748-6.
Price J, Ziv O, Pinckert M, Lim A, Miska E Bioinformatics. 2024; 40(4).
PMID: 38597883 PMC: 11060868. DOI: 10.1093/bioinformatics/btae193.
Chemical crosslinking and ligation methods for in vivo analysis of RNA structures and interactions.
Lee W, Li K, Lu Z Methods Enzymol. 2023; 691:253-281.
PMID: 37914449 PMC: 10994722. DOI: 10.1016/bs.mie.2023.02.020.
A snoRNA-tRNA modification network governs codon-biased cellular states.
Zhang M, Li K, Bai J, Van Damme R, Zhang W, Alba M Proc Natl Acad Sci U S A. 2023; 120(41):e2312126120.
PMID: 37792516 PMC: 10576143. DOI: 10.1073/pnas.2312126120.