» Articles » PMID: 26776191

KNOWLEDGE DRIVEN BINNING AND PHEWAS ANALYSIS IN MARSHFIELD PERSONALIZED MEDICINE RESEARCH PROJECT USING BIOBIN

Overview
Publisher World Scientific
Specialty Biology
Date 2016 Jan 19
PMID 26776191
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

Next-generation sequencing technology has presented an opportunity for rare variant discovery and association of these variants with disease. To address the challenges of rare variant analysis, multiple statistical methods have been developed for combining rare variants to increase statistical power for detecting associations. BioBin is an automated tool that expands on collapsing/binning methods by performing multi-level variant aggregation with a flexible, biologically informed binning strategy using an internal biorepository, the Library of Knowledge (LOKI). The databases within LOKI provide variant details, regional annotations and pathway interactions which can be used to generate bins of biologically-related variants, thereby increasing the power of any subsequent statistical test. In this study, we expand the framework of BioBin to incorporate statistical tests, including a dispersion-based test, SKAT, thereby providing the option of performing a unified collapsing and statistical rare variant analysis in one tool. Extensive simulation studies performed on gene-coding regions showed a Bin-KAT analysis to have greater power than BioBin-regression in all simulated conditions, including variants influencing the phenotype in the same direction, a scenario where burden tests often retain greater power. The use of Madsen- Browning variant weighting increased power in the burden analysis to that equitable with Bin-KAT; but overall Bin-KAT retained equivalent or higher power under all conditions. Bin-KAT was applied to a study of 82 pharmacogenes sequenced in the Marshfield Personalized Medicine Research Project (PMRP). We looked for association of these genes with 9 different phenotypes extracted from the electronic health record. This study demonstrates that Bin-KAT is a powerful tool for the identification of genes harboring low frequency variants for complex phenotypes.

Citing Articles

Impact of natural selection on global patterns of genetic variation and association with clinical phenotypes at genes involved in SARS-CoV-2 infection.

Zhang C, Verma A, Feng Y, Melo M, McQuillan M, Hansen M Proc Natl Acad Sci U S A. 2022; 119(21):e2123000119.

PMID: 35580180 PMC: 9173769. DOI: 10.1073/pnas.2123000119.


Maturation and application of phenome-wide association studies.

Liu S, Crawford D Trends Genet. 2022; 38(4):353-363.

PMID: 34991903 PMC: 8930498. DOI: 10.1016/j.tig.2021.12.002.


Impact of natural selection on global patterns of genetic variation, and association with clinical phenotypes, at genes involved in SARS-CoV-2 infection.

Zhang C, Verma A, Feng Y, Dos Reis Melo M, McQuillan M, Hansen M Res Sq. 2021; .

PMID: 34341784 PMC: 8328070. DOI: 10.21203/rs.3.rs-673011/v1.


Impact of natural selection on global patterns of genetic variation, and association with clinical phenotypes, at genes involved in SARS-CoV-2 infection.

Zhang C, Verma A, Feng Y, Melo M, McQuillan M, Hansen M medRxiv. 2021; .

PMID: 34230933 PMC: 8259910. DOI: 10.1101/2021.06.28.21259529.


Genetic Analysis Reveals Rare Variants in T-Cell Response Gene MR1 Associated with Poor Overall Survival after Urothelial Cancer Diagnosis.

Bang L, Shivakumar M, Garg T, Kim D Cancers (Basel). 2021; 13(8).

PMID: 33919687 PMC: 8069815. DOI: 10.3390/cancers13081864.


References
1.
Rasmussen-Torvik L, Stallings S, Gordon A, Almoguera B, Basford M, Bielinski S . Design and anticipated outcomes of the eMERGE-PGx project: a multicenter pilot for preemptive pharmacogenomics in electronic health record systems. Clin Pharmacol Ther. 2014; 96(4):482-9. PMC: 4169732. DOI: 10.1038/clpt.2014.137. View

2.
Lee S, Wu M, Lin X . Optimal tests for rare variant effects in sequencing association studies. Biostatistics. 2012; 13(4):762-75. PMC: 3440237. DOI: 10.1093/biostatistics/kxs014. View

3.
Maher B . Personal genomes: The case of the missing heritability. Nature. 2008; 456(7218):18-21. DOI: 10.1038/456018a. View

4.
Conneely K, Boehnke M . So many correlated tests, so little time! Rapid adjustment of P values for multiple correlated tests. Am J Hum Genet. 2007; 81(6):1158-68. PMC: 2276357. DOI: 10.1086/522036. View

5.
Asimit J, Day-Williams A, Morris A, Zeggini E . ARIEL and AMELIA: testing for an accumulation of rare variants using next-generation sequencing data. Hum Hered. 2012; 73(2):84-94. PMC: 3477640. DOI: 10.1159/000336982. View