Mining Frequent Stem Patterns from Unaligned RNA Sequences
Overview
Affiliations
Motivation: In detection of non-coding RNAs, it is often necessary to identify the secondary structure motifs from a set of putative RNA sequences. Most of the existing algorithms aim to provide the best motif or few good motifs, but biologists often need to inspect all the possible motifs thoroughly.
Results: Our method RNAmine employs a graph theoretic representation of RNA sequences and detects all the possible motifs exhaustively using a graph mining algorithm. The motif detection problem boils down to finding frequently appearing patterns in a set of directed and labeled graphs. In the tasks of common secondary structure prediction and local motif detection from long sequences, our method performed favorably both in accuracy and in efficiency with the state-of-the-art methods such as CMFinder.
Availability: The software is available upon request.
Multi-level high utility-itemset hiding.
Nguyen L, Duong H, Mai A, Vo B PLoS One. 2025; 20(2):e0317427.
PMID: 39899587 PMC: 11790145. DOI: 10.1371/journal.pone.0317427.
Pietrosanto M, Mattei E, Helmer-Citterich M, Ferre F Nucleic Acids Res. 2016; 44(18):8600-8609.
PMID: 27580722 PMC: 5062999. DOI: 10.1093/nar/gkw750.
Predicting Large RNA-Like Topologies by a Knowledge-Based Clustering Approach.
Baba N, Elmetwaly S, Kim N, Schlick T J Mol Biol. 2015; 428(5 Pt A):811-821.
PMID: 26478223 PMC: 4789128. DOI: 10.1016/j.jmb.2015.10.009.
RNA motif discovery: a computational overview.
Achar A, Saetrom P Biol Direct. 2015; 10:61.
PMID: 26453353 PMC: 4600295. DOI: 10.1186/s13062-015-0090-5.
RiboFSM: frequent subgraph mining for the discovery of RNA structures and interactions.
Gawronski A, Turcotte M BMC Bioinformatics. 2014; 15 Suppl 13:S2.
PMID: 25434643 PMC: 4248650. DOI: 10.1186/1471-2105-15-S13-S2.