How Well Do HapMap SNPs Capture the Untyped SNPs?
Overview
Authors
Affiliations
Background: The recent advancement in human genome sequencing and genotyping has revealed millions of single nucleotide polymorphisms (SNP) which determine the variation among human beings. One of the particular important projects is The International HapMap Project which provides the catalogue of human genetic variation for disease association studies. In this paper, we analyzed the genotype data in HapMap project by using National Institute of Environmental Health Sciences Environmental Genome Project (NIEHS EGP) SNPs. We first determine whether the HapMap data are transferable to the NIEHS data. Then, we study how well the HapMap SNPs capture the untyped SNPs in the region. Finally, we provide general guidelines for determining whether the SNPs chosen from HapMap may be able to capture most of the untyped SNPs.
Results: Our analysis shows that HapMap data are not robust enough to capture the untyped variants for most of the human genes. The performance of SNPs for European and Asian samples are marginal in capturing the untyped variants, i.e. approximately 55%. Expectedly, the SNPs from HapMap YRI panel can only capture approximately 30% of the variants. Although the overall performance is low, however, the SNPs for some genes perform very well and are able to capture most of the variants along the gene. This is observed in the European and Asian panel, but not in African panel. Through observation, we concluded that in order to have a well covered SNPs reference panel, the SNPs density and the association among reference SNPs are important to estimate the robustness of the chosen SNPs.
Conclusion: We have analyzed the coverage of HapMap SNPs using NIEHS EGP data. The results show that HapMap SNPs are transferable to the NIEHS SNPs. However, HapMap SNPs cannot capture some of the untyped SNPs and therefore resequencing may be needed to uncover more SNPs in the missing region.
The limits of genome-wide methods for pharmacogenomic testing.
Gamazon E, Skol A, Perera M Pharmacogenet Genomics. 2012; 22(4):261-72.
PMID: 22344246 PMC: 3655533. DOI: 10.1097/FPC.0b013e328350ca5f.
Evaluating the transferability of Hapmap SNPs to a Singapore Chinese population.
Andiappan A, Anantharaman R, Nilkanth P, Wang D, Chew F BMC Genet. 2010; 11:36.
PMID: 20459637 PMC: 2877651. DOI: 10.1186/1471-2156-11-36.
Genomic and geographic distribution of private SNPs and pathways in human populations.
Baye T, Wilke R, Olivier M Per Med. 2010; 6(6):623-641.
PMID: 20352079 PMC: 2843937. DOI: 10.2217/pme.09.54.
Comprehensive survey of SNPs in the Affymetrix exon array using the 1000 Genomes dataset.
Gamazon E, Zhang W, Dolan M, Cox N PLoS One. 2010; 5(2):e9366.
PMID: 20186275 PMC: 2826392. DOI: 10.1371/journal.pone.0009366.
Beyond the HapMap Genotypic Data: Prospects of Deep Resequencing Projects.
Zhang W, Dolan M Curr Bioinform. 2011; 3(3):178.
PMID: 20151045 PMC: 2819736. DOI: 10.2174/157489308785909232.