» Articles » PMID: 26226460

Population-specific Genotype Imputations Using Minimac or IMPUTE2

Abstract

In order to meaningfully analyze common and rare genetic variants, results from genome-wide association studies (GWASs) of multiple cohorts need to be combined in a meta-analysis in order to obtain enough power. This requires all cohorts to have the same single-nucleotide polymorphisms (SNPs) in their GWASs. To this end, genotypes that have not been measured in a given cohort can be imputed on the basis of a set of reference haplotypes. This protocol provides guidelines for performing imputations with two widely used tools: minimac and IMPUTE2. These guidelines were developed and used by the Genome of the Netherlands (GoNL) consortium, which has created a population-specific reference panel for genetic imputations and used this reference to impute various Dutch biobanks. We also describe several factors that might influence the final imputation quality. This protocol, which has been used by the largest Dutch biobanks, should take approximately several days, depending on the sample size of the biobank and the computer resources available.

Citing Articles

Proxy panels enable privacy-aware outsourcing of genotype imputation.

Zhi D, Jiang X, Harmanci A, Harmanci A Genome Res. 2025; 35(2):326-339.

PMID: 39794122 PMC: 11874966. DOI: 10.1101/gr.278934.124.


Genome-wide analysis in northern Chinese twins identifies twelve new susceptibility loci for pulmonary function.

Wang T, Wang W, Xu C, Tian X, Zhang D BMC Genomics. 2024; 25(1):1255.

PMID: 39736507 PMC: 11684132. DOI: 10.1186/s12864-024-11165-6.


Whole genome sequencing of three native chicken varieties (Common Deshi, Hilly and Naked Neck) of Bangladesh.

Rabbani M, Vallejo-Trujillo A, Wu Z, Miedzinska K, Faruque S, Watson K Sci Data. 2024; 11(1):1432.

PMID: 39719437 PMC: 11668823. DOI: 10.1038/s41597-024-04291-z.


A commonly inherited human PCSK9 germline variant drives breast cancer metastasis via LRP1 receptor.

Mei W, Faraj Tabrizi S, Godina C, Lovisa A, Isaksson K, Jernstrom H Cell. 2024; 188(2):371-389.e28.

PMID: 39657676 PMC: 11770377. DOI: 10.1016/j.cell.2024.11.009.


Polygenic score analyses on antidepressant response in late-life depression, results from the IRL-GRey study.

Elsheikh S, Marshe V, Men X, Islam F, Goncalves V, Pare G Pharmacogenomics J. 2024; 24(6):38.

PMID: 39578436 DOI: 10.1038/s41397-024-00351-0.


References
1.
Anderson C, Pettersson F, Clarke G, Cardon L, Morris A, Zondervan K . Data quality control in genetic case-control association studies. Nat Protoc. 2010; 5(9):1564-73. PMC: 3025522. DOI: 10.1038/nprot.2010.116. View

2.
Abecasis G, Auton A, Brooks L, DePristo M, Durbin R, Handsaker R . An integrated map of genetic variation from 1,092 human genomes. Nature. 2012; 491(7422):56-65. PMC: 3498066. DOI: 10.1038/nature11632. View

3.
Roshyara N, Scholz M . fcGENE: a versatile tool for processing and transforming SNP datasets. PLoS One. 2014; 9(7):e97589. PMC: 4106754. DOI: 10.1371/journal.pone.0097589. View

4.
. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat Genet. 2014; 46(8):818-25. DOI: 10.1038/ng.3021. View

5.
van Leeuwen E, Karssen L, Deelen J, Isaacs A, Medina-Gomez C, Mbarek H . Genome of The Netherlands population-specific imputations identify an ABCA6 variant associated with cholesterol levels. Nat Commun. 2015; 6:6065. PMC: 4366498. DOI: 10.1038/ncomms7065. View