» Articles » PMID: 25161663

A Hidden Markov Model for Haplotype Inference for Present-absent Data of Clustered Genes Using Identified Haplotypes and Haplotype Patterns

Overview
Journal Front Genet
Date 2014 Aug 28
PMID 25161663
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

The majority of killer cell immunoglobin-like receptor (KIR) genes are detected as either present or absent using locus-specific genotyping technology. Ambiguity arises from the presence of a specific KIR gene since the exact copy number (one or two) of that gene is unknown. Therefore, haplotype inference for these genes is becoming more challenging due to such large portion of missing information. Meantime, many haplotypes and partial haplotype patterns have been previously identified due to tight linkage disequilibrium (LD) among these clustered genes thus can be incorporated to facilitate haplotype inference. In this paper, we developed a hidden Markov model (HMM) based method that can incorporate identified haplotypes or partial haplotype patterns for haplotype inference from present-absent data of clustered genes (e.g., KIR genes). We compared its performance with an expectation maximization (EM) based method previously developed in terms of haplotype assignments and haplotype frequency estimation through extensive simulations for KIR genes. The simulation results showed that the new HMM based method outperformed the previous method when some incorrect haplotypes were included as identified haplotypes and/or the standard deviation of haplotype frequencies were small. We also compared the performance of our method with two methods that do not use previously identified haplotypes and haplotype patterns, including an EM based method, HPALORE, and a HMM based method, MaCH. Our simulation results showed that the incorporation of identified haplotypes and partial haplotype patterns can improve accuracy for haplotype inference. The new software package HaploHMM is available and can be downloaded at http://www.soph.uab.edu/ssg/files/People/KZhang/HaploHMM/haplohmm-index.html.

Citing Articles

Estimating KIR Haplotype Frequencies on a Cohort of 10,000 Individuals: A Comprehensive Study on Population Variations, Typing Resolutions, and Reference Haplotypes.

Vierra-Green C, Roe D, Jayaraman J, Trowsdale J, Traherne J, Kuang R PLoS One. 2016; 11(10):e0163973.

PMID: 27723813 PMC: 5056762. DOI: 10.1371/journal.pone.0163973.

References
1.
Niu T, Qin Z, Xu X, Liu J . Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. Am J Hum Genet. 2001; 70(1):157-69. PMC: 448439. DOI: 10.1086/338446. View

2.
Yoo Y, Tang J, Kaslow R, Zhang K . Haplotype inference for present-absent genotype data using previously identified haplotypes and haplotype patterns. Bioinformatics. 2007; 23(18):2399-406. DOI: 10.1093/bioinformatics/btm371. View

3.
Hawley M, Kidd K . HAPLO: a program using the EM algorithm to estimate the frequencies of multi-site haplotypes. J Hered. 1995; 86(5):409-11. DOI: 10.1093/oxfordjournals.jhered.a111613. View

4.
Kitsios G, Zintzaras E . An NOS3 Haplotype is Protective against Hypertension in a Caucasian Population. Int J Hypertens. 2010; 2010:865031. PMC: 2958494. DOI: 10.4061/2010/865031. View

5.
Qin Z, Niu T, Liu J . Partition-ligation-expectation-maximization algorithm for haplotype inference with single-nucleotide polymorphisms. Am J Hum Genet. 2002; 71(5):1242-7. PMC: 385113. DOI: 10.1086/344207. View