RiboFSM: Frequent Subgraph Mining for the Discovery of RNA Structures and Interactions
Overview
Authors
Affiliations
Frequent subgraph mining is a useful method for extracting meaningful patterns from a set of graphs or a single large graph. Here, the graph represents all possible RNA structures and interactions. Patterns that are significantly more frequent in this graph over a random graph are extracted. We hypothesize that these patterns are most likely to represent biological mechanisms. The graph representation used is a directed dual graph, extended to handle intermolecular interactions. The graph is sampled for subgraphs, which are labeled using a canonical labeling method and counted. The resulting patterns are compared to those created from a randomized dataset and scored. The algorithm was applied to the mitochondrial genome of the kinetoplastid species Trypanosoma brucei, which has a unique RNA editing mechanism. The most significant patterns contain two stem-loops, indicative of gRNA, and represent interactions of these structures with target mRNA.
Fuzzy RNA recognition by the Trypanosoma brucei editosome.
Leeder W, Geyer F, Goringer H Nucleic Acids Res. 2022; 50(10):5818-5833.
PMID: 35580050 PMC: 9178004. DOI: 10.1093/nar/gkac357.
Mallik S, Zhao Z Brief Bioinform. 2019; 21(2):368-394.
PMID: 30649169 PMC: 7373185. DOI: 10.1093/bib/bby120.
RiboFSM: frequent subgraph mining for the discovery of RNA structures and interactions.
Gawronski A, Turcotte M BMC Bioinformatics. 2014; 15 Suppl 13:S2.
PMID: 25434643 PMC: 4248650. DOI: 10.1186/1471-2105-15-S13-S2.