» Articles » PMID: 35707511

MAP Segmentation in Bayesian Hidden Markov Models: a Case Study

Overview
Journal J Appl Stat
Specialty Public Health
Date 2022 Jun 16
PMID 35707511
Authors
Affiliations
Soon will be listed here.
Abstract

We consider the problem of estimating the maximum posterior probability (MAP) state sequence for a finite state and finite emission alphabet hidden Markov model (HMM) in the Bayesian setup, where both emission and transition matrices have Dirichlet priors. We study a training set consisting of thousands of protein alignment pairs. The training data is used to set the prior hyperparameters for Bayesian MAP segmentation. Since the Viterbi algorithm is not applicable any more, there is no simple procedure to find the MAP path, and several iterative algorithms are considered and compared. The main goal of the paper is to test the Bayesian setup against the frequentist one, where the parameters of HMM are estimated using the training data.

References
1.
Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H . The Protein Data Bank. Nucleic Acids Res. 1999; 28(1):235-42. PMC: 102472. DOI: 10.1093/nar/28.1.235. View

2.
Boys R, Henderson D . A Bayesian approach to DNA sequence segmentation. Biometrics. 2004; 60(3):573-81. DOI: 10.1111/j.0006-341X.2004.00206.x. View

3.
Guha S, Li Y, Neuberg D . Bayesian Hidden Markov Modeling of Array CGH Data. J Am Stat Assoc. 2012; 103(482):485-497. PMC: 3286622. DOI: 10.1198/016214507000000923. View

4.
Salamov A, Solovyev V . Prediction of protein secondary structure by combining nearest-neighbor algorithms and multiple sequence alignments. J Mol Biol. 1995; 247(1):11-5. DOI: 10.1006/jmbi.1994.0116. View