» Articles » PMID: 19184111

Use of Weighted Reference Panels Based on Empirical Estimates of Ancestry for Capturing Untyped Variation

Overview
Journal Hum Genet
Specialty Genetics
Date 2009 Feb 3
PMID 19184111
Citations 17
Authors
Affiliations
Soon will be listed here.
Abstract

Many association methods use a subset of genotyped single nucleotide polymorphisms (SNPs) to capture or infer genotypes at other untyped SNPs. We and others previously showed that tag SNPs selected to capture common variation using data from The International HapMap Consortium (Nature 437:1299-1320, 2005), The International HapMap Consortium (Nature 449:851-861, 2007) could also capture variation in populations of similar ancestry to HapMap reference populations (de Bakker et al. in Nat Genet 38:1298-1303, 2006; González-Neira et al. in Genome Res 16:323-330, 2006; Montpetit et al. in PLoS Genet 2:282-290, 2006; Mueller et al. in Am J Hum Genet 76:387-398, 2005). To capture variation in admixed populations or populations less similar to HapMap panels, a "cosmopolitan approach," in which all samples from HapMap are used as a single reference panel, was proposed. Here we refine this suggestion and show that use of a "weighted reference panel," constructed based on empirical estimates of ancestry in the target population (relative to available reference panels), is more efficient than the cosmopolitan approach. Weighted reference panels capture, on average, only slightly fewer common variants (minor allele frequency > 5%) than the cosmopolitan approach (mean r (2) = 0.977 vs. 0.989, 94.5% variation captured vs. 96.8% at r (2) > 0.8), across the five populations of the Multiethnic Cohort, but entail approximately 25% fewer tag SNPs per panel (average 538 vs. 718). These results extend a recent study in two Indian populations (Pemberton et al. in Ann Hum Genet 72:535-546, 2008). Weighted reference panels are potentially useful for both the selection of tag SNPs in diverse populations and perhaps in the design of reference panels for imputation of untyped genotypes in genome-wide association studies in admixed populations.

Citing Articles

DISSCO: direct imputation of summary statistics allowing covariates.

Xu Z, Duan Q, Yan S, Chen W, Li M, Lange E Bioinformatics. 2015; 31(15):2434-42.

PMID: 25810429 PMC: 4514926. DOI: 10.1093/bioinformatics/btv168.


Functional significance of single nucleotide polymorphisms in the lactase gene in diverse US patients and evidence for a novel lactase persistence allele at -13909 in those of European ancestry.

Baffour-Awuah N, Fleet S, Montgomery R, Baker S, Butler J, Campbell C J Pediatr Gastroenterol Nutr. 2015; 60(2):182-91.

PMID: 25625576 PMC: 4308731. DOI: 10.1097/MPG.0000000000000595.


Assessment of genotype imputation performance using 1000 Genomes in African American studies.

Hancock D, Levy J, Gaddis N, Bierut L, Saccone N, Page G PLoS One. 2012; 7(11):e50610.

PMID: 23226329 PMC: 3511547. DOI: 10.1371/journal.pone.0050610.


Genotype imputation in a coalescent model with infinitely-many-sites mutation.

Huang L, Buzbas E, Rosenberg N Theor Popul Biol. 2012; 87:62-74.

PMID: 23079542 PMC: 3587719. DOI: 10.1016/j.tpb.2012.09.006.


MaCH-admix: genotype imputation for admixed populations.

Liu E, Li M, Wang W, Li Y Genet Epidemiol. 2012; 37(1):25-37.

PMID: 23074066 PMC: 3524415. DOI: 10.1002/gepi.21690.


References
1.
Haiman C, Patterson N, Freedman M, Myers S, Pike M, Waliszewska A . Multiple regions within 8q24 independently affect risk for prostate cancer. Nat Genet. 2007; 39(5):638-44. PMC: 2638766. DOI: 10.1038/ng2015. View

2.
Zeggini E, Scott L, Saxena R, Voight B, Marchini J, Hu T . Meta-analysis of genome-wide association data and large-scale replication identifies additional susceptibility loci for type 2 diabetes. Nat Genet. 2008; 40(5):638-45. PMC: 2672416. DOI: 10.1038/ng.120. View

3.
Mueller J, Lohmussaar E, Magi R, Remm M, Bettecken T, Lichtner P . Linkage disequilibrium patterns and tagSNP transferability among European populations. Am J Hum Genet. 2005; 76(3):387-98. PMC: 1196391. DOI: 10.1086/427925. View

4.
Cann H, De Toma C, Cazes L, Morel V, Piouffre L, Bodmer J . A human genome diversity cell line panel. Science. 2002; 296(5566):261-2. DOI: 10.1126/science.296.5566.261b. View

5.
. A haplotype map of the human genome. Nature. 2005; 437(7063):1299-320. PMC: 1880871. DOI: 10.1038/nature04226. View