Systematic Evaluation of DNA Sequence Variations on Transcription Factor Binding Affinity
Overview
Affiliations
The majority of the single nucleotide variants (SNVs) identified by genome-wide association studies (GWAS) fall outside of the protein-coding regions. Elucidating the functional implications of these variants has been a major challenge. A possible mechanism for functional non-coding variants is that they disrupted the canonical transcription factor (TF) binding sites that affect the binding of the TF. However, their impact varies since many positions within a TF binding motif are not well conserved. Therefore, simply annotating all variants located in putative TF binding sites may overestimate the functional impact of these SNVs. We conducted a comprehensive survey to study the effect of SNVs on the TF binding affinity. A sequence-based machine learning method was used to estimate the change in binding affinity for each SNV located inside a putative motif site. From the results obtained on 18 TF binding motifs, we found that there is a substantial variation in terms of a SNV's impact on TF binding affinity. We found that only about 20% of SNVs located inside putative TF binding sites would likely to have significant impact on the TF-DNA binding.
Kind L, Molnes J, Tjora E, Raasakka A, Myllykoski M, Colclough K JCI Insight. 2024; 9(11).
PMID: 38855865 PMC: 11382887. DOI: 10.1172/jci.insight.175278.
Han D, Li Y, Wang L, Liang X, Miao Y, Li W Brief Bioinform. 2024; 25(2).
PMID: 38517697 PMC: 10959158. DOI: 10.1093/bib/bbae110.
SNPs in 3'UTR miRNA Target Sequences Associated with Individual Drug Susceptibility.
Rykova E, Ershov N, Damarov I, Merkulova T Int J Mol Sci. 2022; 23(22).
PMID: 36430200 PMC: 9692299. DOI: 10.3390/ijms232213725.
Interrogating the Human Diplome: Computational Methods, Emerging Applications, and Challenges.
Chan A, Choi Y, Rangan A, Zhang G, Podder A, Berens M Methods Mol Biol. 2022; 2590:1-30.
PMID: 36335489 DOI: 10.1007/978-1-0716-2819-5_1.