» Articles » PMID: 7584370

Using Dirichlet Mixture Priors to Derive Hidden Markov Models for Protein Families

Overview
Date 1993 Jan 1
PMID 7584370
Citations 43
Authors
Affiliations
Soon will be listed here.
Abstract

A Bayesian method for estimating the amino acid distributions in the states of a hidden Markov model (HMM) for a protein family or the columns of a multiple alignment of that family is introduced. This method uses Dirichlet mixture densities as priors over amino acid distributions. These mixture densities are determined from examination of previously constructed HMMs or multiple alignments. It is shown that this Bayesian method can improve the quality of HMMs produced from small training sets. Specific experiments on the EF-hand motif are reported, for which these priors are shown to produce HMMs with higher likelihood on unseen data, and fewer false positives and false negatives in a database search task.

Citing Articles

learnMSA: learning and aligning large protein families.

Becker F, Stanke M Gigascience. 2022; 11.

PMID: 36399060 PMC: 9673500. DOI: 10.1093/gigascience/giac104.


Bridging the gaps in statistical models of protein alignment.

Sumanaweera D, Allison L, Konagurthu A Bioinformatics. 2022; 38(Suppl 1):i229-i237.

PMID: 35758809 PMC: 9235498. DOI: 10.1093/bioinformatics/btac246.


Predicting biological pathways of chemical compounds with a profile-inspired approach.

Lopez-Ibanez J, Pazos F, Chagoyen M BMC Bioinformatics. 2021; 22(1):320.

PMID: 34118870 PMC: 8199418. DOI: 10.1186/s12859-021-04252-y.


Machine Boss: rapid prototyping of bioinformatic automata.

Silvestre-Ryan J, Wang Y, Sharma M, Lin S, Shen Y, Dider S Bioinformatics. 2020; 37(1):29-35.

PMID: 32683444 PMC: 8034524. DOI: 10.1093/bioinformatics/btaa633.


Embracing Ambiguity in the Taxonomic Classification of Microbiome Sequencing Data.

Shah N, Meisel J, Pop M Front Genet. 2019; 10:1022.

PMID: 31681437 PMC: 6811648. DOI: 10.3389/fgene.2019.01022.