» Articles » PMID: 25722376

Base-resolution Methylation Patterns Accurately Predict Transcription Factor Bindings in Vivo

Overview
Specialty Biochemistry
Date 2015 Feb 28
PMID 25722376
Citations 29
Authors
Affiliations
Soon will be listed here.
Abstract

Detecting in vivo transcription factor (TF) binding is important for understanding gene regulatory circuitries. ChIP-seq is a powerful technique to empirically define TF binding in vivo. However, the multitude of distinct TFs makes genome-wide profiling for them all labor-intensive and costly. Algorithms for in silico prediction of TF binding have been developed, based mostly on histone modification or DNase I hypersensitivity data in conjunction with DNA motif and other genomic features. However, technical limitations of these methods prevent them from being applied broadly, especially in clinical settings. We conducted a comprehensive survey involving multiple cell lines, TFs, and methylation types and found that there are intimate relationships between TF binding and methylation level changes around the binding sites. Exploiting the connection between DNA methylation and TF binding, we proposed a novel supervised learning approach to predict TF-DNA interaction using data from base-resolution whole-genome methylation sequencing experiments. We devised beta-binomial models to characterize methylation data around TF binding sites and the background. Along with other static genomic features, we adopted a random forest framework to predict TF-DNA interaction. After conducting comprehensive tests, we saw that the proposed method accurately predicts TF binding and performs favorably versus competing methods.

Citing Articles

SIGRN: Inferring Gene Regulatory Network with Soft Introspective Variational Autoencoders.

Li R, Wu J, Li G, Liu J, Liu J, Xuan J Int J Mol Sci. 2024; 25(23).

PMID: 39684451 PMC: 11641499. DOI: 10.3390/ijms252312741.


Using methylation data to improve transcription factor binding prediction.

Morgan D, DeMeo D, Glass K Epigenetics. 2024; 19(1):2309826.

PMID: 38300850 PMC: 10841018. DOI: 10.1080/15592294.2024.2309826.


Modeling methyl-sensitive transcription factor motifs with an expanded epigenetic alphabet.

Viner C, Ishak C, Johnson J, Walker N, Shi H, Sjoberg-Herrera M Genome Biol. 2024; 25(1):11.

PMID: 38191487 PMC: 10773111. DOI: 10.1186/s13059-023-03070-0.


Databases and prospects of dynamic gene regulation in eukaryotes: A mini review.

Chow C, Yang C, Chang W Comput Struct Biotechnol J. 2023; 21:2147-2159.

PMID: 37013004 PMC: 10066511. DOI: 10.1016/j.csbj.2023.03.032.


Toward a base-resolution panorama of the in vivo impact of cytosine methylation on transcription factor binding.

Hernandez-Corchado A, Najafabadi H Genome Biol. 2022; 23(1):151.

PMID: 35799193 PMC: 9264634. DOI: 10.1186/s13059-022-02713-y.


References
1.
Mathelier A, Zhao X, Zhang A, Parcy F, Worsley-Hunt R, Arenillas D . JASPAR 2014: an extensively expanded and updated open-access database of transcription factor binding profiles. Nucleic Acids Res. 2013; 42(Database issue):D142-7. PMC: 3965086. DOI: 10.1093/nar/gkt997. View

2.
Hu S, Wan J, Su Y, Song Q, Zeng Y, Nguyen H . DNA methylation presents distinct binding sites for human transcription factors. Elife. 2013; 2:e00726. PMC: 3762332. DOI: 10.7554/eLife.00726. View

3.
Heard E, Martienssen R . Transgenerational epigenetic inheritance: myths and mechanisms. Cell. 2014; 157(1):95-109. PMC: 4020004. DOI: 10.1016/j.cell.2014.02.045. View

4.
Feng H, Conneely K, Wu H . A Bayesian hierarchical model to detect differentially methylated loci from single nucleotide resolution sequencing data. Nucleic Acids Res. 2014; 42(8):e69. PMC: 4005660. DOI: 10.1093/nar/gku154. View

5.
Bolger A, Lohse M, Usadel B . Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014; 30(15):2114-20. PMC: 4103590. DOI: 10.1093/bioinformatics/btu170. View