» Articles » PMID: 39702642

Population-specific Reference Panel Improves Imputation Quality for Genome-wide Association Studies Conducted on the Japanese Population

Abstract

To improve imputation quality for genome-wide association studies (GWAS) conducted on the Japanese population, we developed and evaluated four Japanese population-specific reference panels. These panels were constructed through the augmentation of the 1000 Genomes Project (1KG) panel using Japanese whole genome sequencing (WGS) data, with sample sizes ranging from 1 K to 7 K individuals enrolled through the Biobank Japan (BBJ) project, and sequencing depths ranging from 3× to 30×. Among these panels, an augmented reference panel comprising 7472 WGS samples of mixed depth (1KG+7K) exhibit the greatest improvement in imputation quality relative to the Trans-Omics for Precision Medicine (TOPMed) reference panel. Notably, we observe these improvements primarily for rare variants with a minor allele frequency (MAF) <5%. To demonstrate the benefits of improved imputation quality in association analyses of complex traits, we conducted GWAS for serum uric acid and total cholesterol levels following imputation up to the 1KG+7K panel. The analysis reveals several loci reaching genome-wide significance (P < 5 × 10) in the 1KG+7K imputation output yet remaining undetected when the same sample set is imputed up to the TOPMed reference panel. In summary, the 1KG+7K panel demonstrates significant advantages in the discovery of trait-associated loci, particularly those influenced by low-frequency association signals.

Citing Articles

A variant in HMMR/HMMR-AS1 is associated with serum alanine aminotransferase levels in the Ryukyu population.

Ohyama N, Matsunami M, Imamura M, Yoshida A, Javed A, Liu X Sci Rep. 2025; 15(1):6494.

PMID: 39987337 PMC: 11846991. DOI: 10.1038/s41598-025-90195-w.

References
1.
Yoo S, Kim C, Kim H, Kim S, Shin J, Kim N . NARD: whole-genome reference panel of 1779 Northeast Asians improves imputation accuracy of rare and low-frequency variants. Genome Med. 2019; 11(1):64. PMC: 6805399. DOI: 10.1186/s13073-019-0677-z. View

2.
McCarthy S, Das S, Kretzschmar W, Delaneau O, Wood A, Teumer A . A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet. 2016; 48(10):1279-83. PMC: 5388176. DOI: 10.1038/ng.3643. View

3.
Byrska-Bishop M, Evani U, Zhao X, Basile A, Abel H, Regier A . High-coverage whole-genome sequencing of the expanded 1000 Genomes Project cohort including 602 trios. Cell. 2022; 185(18):3426-3440.e19. PMC: 9439720. DOI: 10.1016/j.cell.2022.08.004. View

4.
Schaid D, Chen W, Larson N . From genome-wide associations to candidate causal variants by statistical fine-mapping. Nat Rev Genet. 2018; 19(8):491-504. PMC: 6050137. DOI: 10.1038/s41576-018-0016-z. View

5.
Nakayama A, Nakatochi M, Kawamura Y, Yamamoto K, Nakaoka H, Shimizu S . Subtype-specific gout susceptibility loci and enrichment of selection pressure on and identified by subtype genome-wide meta-analyses of clinically defined gout patients. Ann Rheum Dis. 2020; 79(5):657-665. PMC: 7213308. DOI: 10.1136/annrheumdis-2019-216644. View