» Articles » PMID: 34567058

Systematic Evaluation of DNA Sequence Variations on Transcription Factor Binding Affinity

Overview
Journal Front Genet
Date 2021 Sep 27
PMID 34567058
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

The majority of the single nucleotide variants (SNVs) identified by genome-wide association studies (GWAS) fall outside of the protein-coding regions. Elucidating the functional implications of these variants has been a major challenge. A possible mechanism for functional non-coding variants is that they disrupted the canonical transcription factor (TF) binding sites that affect the binding of the TF. However, their impact varies since many positions within a TF binding motif are not well conserved. Therefore, simply annotating all variants located in putative TF binding sites may overestimate the functional impact of these SNVs. We conducted a comprehensive survey to study the effect of SNVs on the TF binding affinity. A sequence-based machine learning method was used to estimate the change in binding affinity for each SNV located inside a putative motif site. From the results obtained on 18 TF binding motifs, we found that there is a substantial variation in terms of a SNV's impact on TF binding affinity. We found that only about 20% of SNVs located inside putative TF binding sites would likely to have significant impact on the TF-DNA binding.

Citing Articles

Molecular mechanism of HNF-1A-mediated HNF4A gene regulation and promoter-driven HNF4A-MODY diabetes.

Kind L, Molnes J, Tjora E, Raasakka A, Myllykoski M, Colclough K JCI Insight. 2024; 9(11).

PMID: 38855865 PMC: 11382887. DOI: 10.1172/jci.insight.175278.


Comparative analysis of models in predicting the effects of SNPs on TF-DNA binding using large-scale in vitro and in vivo data.

Han D, Li Y, Wang L, Liang X, Miao Y, Li W Brief Bioinform. 2024; 25(2).

PMID: 38517697 PMC: 10959158. DOI: 10.1093/bib/bbae110.


SNPs in 3'UTR miRNA Target Sequences Associated with Individual Drug Susceptibility.

Rykova E, Ershov N, Damarov I, Merkulova T Int J Mol Sci. 2022; 23(22).

PMID: 36430200 PMC: 9692299. DOI: 10.3390/ijms232213725.


Interrogating the Human Diplome: Computational Methods, Emerging Applications, and Challenges.

Chan A, Choi Y, Rangan A, Zhang G, Podder A, Berens M Methods Mol Biol. 2022; 2590:1-30.

PMID: 36335489 DOI: 10.1007/978-1-0716-2819-5_1.

References
1.
Ionita-Laza I, McCallum K, Xu B, Buxbaum J . A spectral approach integrating functional genomic annotations for coding and noncoding variants. Nat Genet. 2016; 48(2):214-20. PMC: 4731313. DOI: 10.1038/ng.3477. View

2.
Xu T, Li B, Zhao M, Szulwach K, Street R, Lin L . Base-resolution methylation patterns accurately predict transcription factor bindings in vivo. Nucleic Acids Res. 2015; 43(5):2757-66. PMC: 4357735. DOI: 10.1093/nar/gkv151. View

3.
Cookson W, Liang L, Abecasis G, Moffatt M, Lathrop M . Mapping complex disease traits with global gene expression. Nat Rev Genet. 2009; 10(3):184-94. PMC: 4550035. DOI: 10.1038/nrg2537. View

4.
Pasquali L, Gaulton K, Rodriguez-Segui S, Mularoni L, Miguel-Escalada I, Akerman I . Pancreatic islet enhancer clusters enriched in type 2 diabetes risk-associated variants. Nat Genet. 2014; 46(2):136-143. PMC: 3935450. DOI: 10.1038/ng.2870. View

5.
Vinga S . Information theory applications for biological sequence analysis. Brief Bioinform. 2013; 15(3):376-89. PMC: 7109941. DOI: 10.1093/bib/bbt068. View