A Bayesian Statistical Algorithm for RNA Secondary Structure Prediction
Overview
Medical Informatics
Authors
Affiliations
A Bayesian approach for predicting RNA secondary structure that addresses the following three open issues is described: (1) the need for a representation of the full ensemble of probable structures; (2) the need to specify a fixed set of energy parameters; (3) the desire to make statistical inferences on all variables in the problem. It has recently been shown that Bayesian inference can be employed to relax or eliminate the need to specify the parameters of bioinformatics recursive algorithms and to give a statistical representation of the full ensemble of probable solutions with the incorporation of uncertainty in parameter values. In this paper, we make an initial exploration of these potential advantages of the Bayesian approach. We present a Bayesian algorithm that is based on stacking energy rules but relaxes the need to specify the parameters. The algorithm returns the exact posterior distribution of the number of destabilizing loops, stacking energy matrices, and secondary structures. The algorithm generates statistically representative structures from the full ensemble of probable secondary structures in exact proportion to the posterior probabilities. Once the forward recursions for the algorithm are completed, the backward recursive sampling executes in O(n) time, providing a very efficient approach for generating representative structures. We demonstrate the utility of the Bayesian approach with several tRNA sequences. The potential of the approach for predicting RNA secondary structures and presenting alternative structures is illustrated with applications to the Escherichia coli tRNA(Ala) sequence and the Xenopus laevis oocyte 5S rRNA sequence.
Davis E, Raman R, Byrne S, Ghanegolmohammadi F, Mathur C, Begley U bioRxiv. 2025; .
PMID: 39974974 PMC: 11838421. DOI: 10.1101/2025.02.03.636209.
Watters K, Yu A, Strobel E, Settle A, Lucks J Methods. 2016; 103:34-48.
PMID: 27064082 PMC: 4921265. DOI: 10.1016/j.ymeth.2016.04.002.
Joint modeling of RNase footprint sequencing profiles for genome-wide inference of RNA structure.
Zou C, Ouyang Z Nucleic Acids Res. 2015; 43(19):9187-97.
PMID: 26400167 PMC: 4627092. DOI: 10.1093/nar/gkv950.
Emmrich S, Wang W, John K, Li W, Putzer B Mol Cancer. 2009; 8:61.
PMID: 19671150 PMC: 2734544. DOI: 10.1186/1476-4598-8-61.
The evolutionary history of the structure of 5S ribosomal RNA.
Sun F, Caetano-Anolles G J Mol Evol. 2009; 69(5):430-43.
PMID: 19639237 DOI: 10.1007/s00239-009-9264-z.