» Articles » PMID: 35001978

A Computationally Efficient Bayesian Seemingly Unrelated Regressions Model for High-dimensional Quantitative Trait Loci Discovery

Overview
Specialty Public Health
Date 2022 Jan 10
PMID 35001978
Authors
Affiliations
Soon will be listed here.
Abstract

Our work is motivated by the search for metabolite quantitative trait loci (QTL) in a cohort of more than 5000 people. There are 158 metabolites measured by NMR spectroscopy in the 31-year follow-up of the Northern Finland Birth Cohort 1966 (NFBC66). These metabolites, as with many multivariate phenotypes produced by high-throughput biomarker technology, exhibit strong correlation structures. Existing approaches for combining such data with genetic variants for multivariate QTL analysis generally ignore phenotypic correlations or make restrictive assumptions about the associations between phenotypes and genetic loci. We present a computationally efficient Bayesian seemingly unrelated regressions model for high-dimensional data, with cell-sparse variable selection and sparse graphical structure for covariance selection. Cell sparsity allows different phenotype responses to be associated with different genetic predictors and the graphical structure is used to represent the conditional dependencies between phenotype variables. To achieve feasible computation of the large model space, we exploit a factorisation of the covariance matrix. Applying the model to the NFBC66 data with 9000 directly genotyped single nucleotide polymorphisms, we are able to simultaneously estimate genotype-phenotype associations and the residual dependence structure among the metabolites. The R package BayesSUR with full documentation is available at https://cran.r-project.org/web/packages/BayesSUR/.

Citing Articles

A Bayesian multivariate hierarchical model for developing a treatment benefit index using mixed types of outcomes.

Wu D, Goldfeld K, Petkova E, Park H BMC Med Res Methodol. 2024; 24(1):218.

PMID: 39333874 PMC: 11437666. DOI: 10.1186/s12874-024-02333-z.


Inferring personal intake recommendations of phosphorous and potassium for end-stage renal failure patients by simulating with Bayesian hierarchical multivariate model.

Turkia J, Schwab U, Hautamaki V PLoS One. 2024; 19(2):e0291153.

PMID: 38319948 PMC: 10846746. DOI: 10.1371/journal.pone.0291153.


Improving Individualized Treatment Decisions: A Bayesian Multivariate Hierarchical Model for Developing a Treatment Benefit Index using Mixed Types of Outcomes.

Wu D, Goldfeld K, Petkova E, Park H medRxiv. 2023; .

PMID: 38014277 PMC: 10680905. DOI: 10.1101/2023.11.17.23298711.


Fast and flexible joint fine-mapping of multiple traits via the Sum of Single Effects model.

Zou Y, Carbonetto P, Xie D, Wang G, Stephens M bioRxiv. 2023; .

PMID: 37425935 PMC: 10327118. DOI: 10.1101/2023.04.14.536893.

References
1.
Ruffieux H, Davison A, Hager J, Inshaw J, Fairfax B, Richardson S . A Global-Local Approach for Detecting Hotspots in Multiple-Response Regression. Ann Appl Stat. 2022; 14(2):905-928. PMC: 7612176. DOI: 10.1214/20-AOAS1332. View

2.
Ruffieux H, Fairfax B, Nassiri I, Vigorito E, Wallace C, Richardson S . EPISPOT: An epigenome-driven approach for detecting and interpreting hotspots in molecular QTL studies. Am J Hum Genet. 2021; 108(6):983-1000. PMC: 8206410. DOI: 10.1016/j.ajhg.2021.04.010. View

3.
Thomas A, Green P . Enumerating the junction trees of a decomposable graph. J Comput Graph Stat. 2010; 18(4):930-940. PMC: 2963453. DOI: 10.1198/jcgs.2009.07129. View

4.
Shabalin A . Matrix eQTL: ultra fast eQTL analysis via large matrix operations. Bioinformatics. 2012; 28(10):1353-8. PMC: 3348564. DOI: 10.1093/bioinformatics/bts163. View

5.
Wurtz P, Kangas A, Soininen P, Lawlor D, Davey Smith G, Ala-Korpela M . Quantitative Serum Nuclear Magnetic Resonance Metabolomics in Large-Scale Epidemiology: A Primer on -Omic Technologies. Am J Epidemiol. 2017; 186(9):1084-1096. PMC: 5860146. DOI: 10.1093/aje/kwx016. View