» Articles » PMID: 16982009

How Well Do HapMap SNPs Capture the Untyped SNPs?

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2006 Sep 20
PMID 16982009
Citations 14
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The recent advancement in human genome sequencing and genotyping has revealed millions of single nucleotide polymorphisms (SNP) which determine the variation among human beings. One of the particular important projects is The International HapMap Project which provides the catalogue of human genetic variation for disease association studies. In this paper, we analyzed the genotype data in HapMap project by using National Institute of Environmental Health Sciences Environmental Genome Project (NIEHS EGP) SNPs. We first determine whether the HapMap data are transferable to the NIEHS data. Then, we study how well the HapMap SNPs capture the untyped SNPs in the region. Finally, we provide general guidelines for determining whether the SNPs chosen from HapMap may be able to capture most of the untyped SNPs.

Results: Our analysis shows that HapMap data are not robust enough to capture the untyped variants for most of the human genes. The performance of SNPs for European and Asian samples are marginal in capturing the untyped variants, i.e. approximately 55%. Expectedly, the SNPs from HapMap YRI panel can only capture approximately 30% of the variants. Although the overall performance is low, however, the SNPs for some genes perform very well and are able to capture most of the variants along the gene. This is observed in the European and Asian panel, but not in African panel. Through observation, we concluded that in order to have a well covered SNPs reference panel, the SNPs density and the association among reference SNPs are important to estimate the robustness of the chosen SNPs.

Conclusion: We have analyzed the coverage of HapMap SNPs using NIEHS EGP data. The results show that HapMap SNPs are transferable to the NIEHS SNPs. However, HapMap SNPs cannot capture some of the untyped SNPs and therefore resequencing may be needed to uncover more SNPs in the missing region.

Citing Articles

The limits of genome-wide methods for pharmacogenomic testing.

Gamazon E, Skol A, Perera M Pharmacogenet Genomics. 2012; 22(4):261-72.

PMID: 22344246 PMC: 3655533. DOI: 10.1097/FPC.0b013e328350ca5f.


Evaluating the transferability of Hapmap SNPs to a Singapore Chinese population.

Andiappan A, Anantharaman R, Nilkanth P, Wang D, Chew F BMC Genet. 2010; 11:36.

PMID: 20459637 PMC: 2877651. DOI: 10.1186/1471-2156-11-36.


Genomic and geographic distribution of private SNPs and pathways in human populations.

Baye T, Wilke R, Olivier M Per Med. 2010; 6(6):623-641.

PMID: 20352079 PMC: 2843937. DOI: 10.2217/pme.09.54.


Comprehensive survey of SNPs in the Affymetrix exon array using the 1000 Genomes dataset.

Gamazon E, Zhang W, Dolan M, Cox N PLoS One. 2010; 5(2):e9366.

PMID: 20186275 PMC: 2826392. DOI: 10.1371/journal.pone.0009366.


Beyond the HapMap Genotypic Data: Prospects of Deep Resequencing Projects.

Zhang W, Dolan M Curr Bioinform. 2011; 3(3):178.

PMID: 20151045 PMC: 2819736. DOI: 10.2174/157489308785909232.


References
1.
. The International HapMap Project. Nature. 2003; 426(6968):789-96. DOI: 10.1038/nature02168. View

2.
Carlson C, Eberle M, Rieder M, Yi Q, Kruglyak L, Nickerson D . Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium. Am J Hum Genet. 2003; 74(1):106-20. PMC: 1181897. DOI: 10.1086/381000. View

3.
Zhang K, Qin Z, Liu J, Chen T, Waterman M, Sun F . Haplotype block partitioning and tag SNP selection using genotype data and their applications to association studies. Genome Res. 2004; 14(5):908-16. PMC: 479119. DOI: 10.1101/gr.1837404. View

4.
Reich D, Cargill M, Bolk S, Ireland J, Sabeti P, Richter D . Linkage disequilibrium in the human genome. Nature. 2001; 411(6834):199-204. DOI: 10.1038/35075590. View

5.
Daly M, Rioux J, Schaffner S, Hudson T, Lander E . High-resolution haplotype structure in the human genome. Nat Genet. 2001; 29(2):229-32. DOI: 10.1038/ng1001-229. View