» Articles » PMID: 25629170

Imputation of the Rare HOXB13 G84E Mutation and Cancer Risk in a Large Population-based Cohort

Abstract

An efficient approach to characterizing the disease burden of rare genetic variants is to impute them into large well-phenotyped cohorts with existing genome-wide genotype data using large sequenced referenced panels. The success of this approach hinges on the accuracy of rare variant imputation, which remains controversial. For example, a recent study suggested that one cannot adequately impute the HOXB13 G84E mutation associated with prostate cancer risk (carrier frequency of 0.0034 in European ancestry participants in the 1000 Genomes Project). We show that by utilizing the 1000 Genomes Project data plus an enriched reference panel of mutation carriers we were able to accurately impute the G84E mutation into a large cohort of 83,285 non-Hispanic White participants from the Kaiser Permanente Research Program on Genes, Environment and Health Genetic Epidemiology Research on Adult Health and Aging cohort. Imputation authenticity was confirmed via a novel classification and regression tree method, and then empirically validated analyzing a subset of these subjects plus an additional 1,789 men from Kaiser specifically genotyped for the G84E mutation (r2 = 0.57, 95% CI = 0.37–0.77). We then show the value of this approach by using the imputed data to investigate the impact of the G84E mutation on age-specific prostate cancer risk and on risk of fourteen other cancers in the cohort. The age-specific risk of prostate cancer among G84E mutation carriers was higher than among non-carriers. Risk estimates from Kaplan-Meier curves were 36.7% versus 13.6% by age 72, and 64.2% versus 24.2% by age 80, for G84E mutation carriers and non-carriers, respectively (p = 3.4x10-12). The G84E mutation was also associated with an increase in risk for the fourteen other most common cancers considered collectively (p = 5.8x10-4) and more so in cases diagnosed with multiple cancer types, both those including and not including prostate cancer, strongly suggesting pleiotropic effects. [corrected].

Citing Articles

SEAD reference panel with 22,134 haplotypes boosts rare variant imputation and genome-wide association analysis in Asian populations.

Yang M, Zhong J, Li X, Tian G, Bai W, Fang Y Nat Commun. 2024; 15(1):10839.

PMID: 39738056 PMC: 11686012. DOI: 10.1038/s41467-024-55147-4.


Dissecting the Reduced Penetrance of Putative Loss-of-Function Variants in Population-Scale Biobanks.

Blair D, Risch N medRxiv. 2024; .

PMID: 39399029 PMC: 11469360. DOI: 10.1101/2024.09.23.24314008.


Susceptibility Genes Associated with Multiple Primary Cancers.

Lu M, Zhang X, Chu Q, Chen Y, Zhang P Cancers (Basel). 2023; 15(24).

PMID: 38136334 PMC: 10741435. DOI: 10.3390/cancers15245788.


Assessment of genetic susceptibility to multiple primary cancers through whole-exome sequencing in two large multi-ancestry studies.

Cavazos T, Kachuri L, Graff R, Nierenberg J, Thai K, Alexeeff S BMC Med. 2022; 20(1):332.

PMID: 36199081 PMC: 9535845. DOI: 10.1186/s12916-022-02535-6.


Germline HOXB13 mutation p.G84E do not confer an increased bladder or kidney cancer risk in polish population.

Zlowocka-Perlowska E, Toloczko-Grabarek A, Lubinski J Hered Cancer Clin Pract. 2022; 20(1):1.

PMID: 34983599 PMC: 8728939. DOI: 10.1186/s13053-021-00208-8.


References
1.
Wood A, Perry J, Tanaka T, Hernandez D, Zheng H, Melzer D . Imputation of variants from the 1000 Genomes Project modestly improves known associations and can identify low-frequency variant-phenotype associations undetected by HapMap based imputation. PLoS One. 2013; 8(5):e64343. PMC: 3655956. DOI: 10.1371/journal.pone.0064343. View

2.
Duan Q, Liu E, Auer P, Zhang G, Lange E, Jun G . Imputation of coding variants in African Americans: better performance using data from the exome sequencing project. Bioinformatics. 2013; 29(21):2744-9. PMC: 3799474. DOI: 10.1093/bioinformatics/btt477. View

3.
Price A, Patterson N, Plenge R, Weinblatt M, Shadick N, Reich D . Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet. 2006; 38(8):904-9. DOI: 10.1038/ng1847. View

4.
Howie B, Marchini J, Stephens M . Genotype imputation with thousands of genomes. G3 (Bethesda). 2012; 1(6):457-70. PMC: 3276165. DOI: 10.1534/g3.111.001198. View

5.
Bhattacharjee S, Rajaraman P, Jacobs K, Wheeler W, Melin B, Hartge P . A subset-based approach improves power and interpretation for the combined analysis of genetic association studies of heterogeneous traits. Am J Hum Genet. 2012; 90(5):821-35. PMC: 3376551. DOI: 10.1016/j.ajhg.2012.03.015. View