» Articles » PMID: 39282458

Genotype Inference from Aggregated Chromatin Accessibility Data Reveals Genetic Regulatory Mechanisms

Overview
Journal bioRxiv
Date 2024 Sep 16
PMID 39282458
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Understanding the genetic causes for variability in chromatin accessibility can shed light on the molecular mechanisms through which genetic variants may affect complex traits. Thousands of ATAC-seq samples have been collected that hold information about chromatin accessibility across diverse cell types and contexts, but most of these are not paired with genetic information and come from diverse distinct projects and laboratories.

Results: We report here joint genotyping, chromatin accessibility peak calling, and discovery of quantitative trait loci which influence chromatin accessibility (caQTLs), demonstrating the capability of performing caQTL analysis on a large scale in a diverse sample set without pre-existing genotype information. Using 10,293 profiling samples representing 1,454 unique donor individuals across 653 studies from public databases, we catalog 23,381 caQTLs in total. After joint discovery analysis, we cluster samples based on accessible chromatin profiles to identify context-specific caQTLs. We find that caQTLs are strongly enriched for annotations of gene regulatory elements across diverse cell types and tissues and are often strongly linked with genetic variation associated with changes in expression (eQTLs), indicating that caQTLs can mediate genetic effects on gene expression. We demonstrate sharing of causal variants for chromatin accessibility and diverse complex human traits, enabling a more complete picture of the genetic mechanisms underlying complex human phenotypes.

Conclusions: Our work provides a proof of principle for caQTL calling from previously ungenotyped samples, and represents one of the largest, most diverse caQTL resources currently available, informing mechanisms of genetic regulation of gene expression and contribution to disease.

References
1.
Zhu Z, Zhang F, Hu H, Bakshi A, Robinson M, Powell J . Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat Genet. 2016; 48(5):481-7. DOI: 10.1038/ng.3538. View

2.
Pandey G, Vadlamudi S, Currin K, Moxley A, Nicholas J, McAfee J . Liver regulatory mechanisms of noncoding variants at lipid and metabolic trait loci. HGG Adv. 2024; 5(2):100275. PMC: 10881423. DOI: 10.1016/j.xhgg.2024.100275. View

3.
Turner A, Hu S, Mosquera J, Ma W, Hodonsky C, Wong D . Single-nucleus chromatin accessibility profiling highlights regulatory mechanisms of coronary artery disease risk. Nat Genet. 2022; 54(6):804-816. PMC: 9203933. DOI: 10.1038/s41588-022-01069-0. View

4.
Bailey T, Krajewski P, Ladunga I, Lefebvre C, Li Q, Liu T . Practical guidelines for the comprehensive analysis of ChIP-seq data. PLoS Comput Biol. 2013; 9(11):e1003326. PMC: 3828144. DOI: 10.1371/journal.pcbi.1003326. View

5.
Maurano M, Humbert R, Rynes E, Thurman R, Haugen E, Wang H . Systematic localization of common disease-associated variation in regulatory DNA. Science. 2012; 337(6099):1190-5. PMC: 3771521. DOI: 10.1126/science.1222794. View