» Articles » PMID: 36705896

Linear-Time Algorithms for RNA Structure Prediction

Overview
Specialty Molecular Biology
Date 2023 Jan 27
PMID 36705896
Authors
Affiliations
Soon will be listed here.
Abstract

RNA secondary structure prediction is widely used to understand RNA function. Existing dynamic programming-based algorithms, both the classical minimum free energy (MFE) methods and partition function methods, suffer from a major limitation: their runtimes scale cubically with the RNA length, and this slowness limits their use in genome-wide applications. Inspired by incremental parsing for context-free grammars in computational linguistics, we designed linear-time heuristic algorithms, LinearFold and LinearPartition, to approximate the MFE structure, partition function and base pairing probabilities. These programs are orders of magnitude faster than Vienna RNAfold and CONTRAfold on long sequences. More interestingly, LinearFold and LinearPartition lead to more accurate predictions on the longest sequence families for which the structures are well established (16S and 23S Ribosomal RNAs), as well as improved accuracies for long-range base pairs (500 +  nucleotides apart). This chapter provides protocols for using LinearFold and LinearPartition for secondary structure prediction.

Citing Articles

SARS-CoV-2 CoCoPUTs: analyzing GISAID and NCBI data to obtain codon statistics, mutations, and free energy over a multiyear period.

Padhiar N, Ghazanchyan T, Fumagalli S, DiCuccio M, Cohen G, Ginzburg A Virus Evol. 2025; 11(1):veae115.

PMID: 39882309 PMC: 11776705. DOI: 10.1093/ve/veae115.


mRNA vaccine sequence and structure design and optimization: Advances and challenges.

Jin L, Zhou Y, Zhang S, Chen S J Biol Chem. 2024; 301(1):108015.

PMID: 39608721 PMC: 11728972. DOI: 10.1016/j.jbc.2024.108015.

References
1.
Bachellerie J, Cavaille J, Huttenhofer A . The expanding snoRNA world. Biochimie. 2002; 84(8):775-90. DOI: 10.1016/s0300-9084(02)01402-5. View

2.
Bellaousov S, Mathews D . ProbKnot: fast prediction of RNA secondary structure including pseudoknots. RNA. 2010; 16(10):1870-80. PMC: 2941096. DOI: 10.1261/rna.2125310. View

3.
Bernhart S, Hofacker I, Stadler P . Local RNA base pairing probabilities in large sequences. Bioinformatics. 2005; 22(5):614-5. DOI: 10.1093/bioinformatics/btk014. View

4.
Clote P, Ponty Y, Steyaert J . Expected distance between terminal nucleotides of RNA secondary structures. J Math Biol. 2011; 65(3):581-99. DOI: 10.1007/s00285-011-0467-8. View

5.
Ding Y, Lawrence C . A statistical sampling algorithm for RNA secondary structure prediction. Nucleic Acids Res. 2003; 31(24):7280-301. PMC: 297010. DOI: 10.1093/nar/gkg938. View