» Articles » PMID: 22499688

Improving Hidden Markov Models for Classification of Human Immunodeficiency Virus-1 Subtypes Through Linear Classifier Learning

Overview
Date 2012 Apr 14
PMID 22499688
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Profile Hidden Markov Models (pHMMs) are widely used to model nucleotide or protein sequence families. In many applications, a sequence family classified into several subfamilies is given and each subfamily is modeled separately by one pHMM. A major drawback of this approach is the difficulty of coping with subfamilies composed of very few sequences.Correct subtyping of human immunodeficiency virus-1 (HIV-1) sequences is one of the most crucial bioinformatic tasks affected by this problem of small subfamilies, i.e., HIV-1 subtypes with a small number of known sequences. To deal with small samples for particular subfamilies of HIV-1, we employ a machine learning approach. More precisely, we make use of an existing HMM architecture and its associated inference engine, while replacing the unsupervised estimation of emission probabilities by a supervised method. For that purpose, we use regularized linear discriminant learning together with a balancing scheme to account for the widely varying sample size. After training the multiclass linear discriminants, the corresponding weights are transformed to valid probabilities using a softmax function.We apply this modified algorithm to classify HIV-1 sequence data (in the form of partial-length HIV-1 sequences and semi-artificial recombinants) and show that the performance of pHMMs can be significantly improved by the proposed technique.

Citing Articles

HIV-1 transmitted drug resistance mutations among antiretroviral therapy-Naïve individuals in Surabaya, Indonesia.

Kotaki T, Khairunisa S, Witaningrum A, Yunifiar M M, Sukartiningrum S, Noor Diansyah M AIDS Res Ther. 2017; 12:5.

PMID: 28561075 PMC: 4336490. DOI: 10.1186/s12981-015-0046-y.


A model-based information sharing protocol for profile Hidden Markov Models used for HIV-1 recombination detection.

Bulla I, Schultz A, Chesneau C, Mark T, Serea F BMC Bioinformatics. 2014; 15:205.

PMID: 24946781 PMC: 4230192. DOI: 10.1186/1471-2105-15-205.


HIV-1 subtypes B and C unique recombinant forms (URFs) and transmitted drug resistance identified in the Western Cape Province, South Africa.

Jacobs G, Wilkinson E, Isaacs S, Spies G, de Oliveira T, Seedat S PLoS One. 2014; 9(6):e90845.

PMID: 24609015 PMC: 3946584. DOI: 10.1371/journal.pone.0090845.