» Articles » PMID: 29617876

A Novel Method for Improved Accuracy of Transcription Factor Binding Site Prediction

Overview
Specialty Biochemistry
Date 2018 Apr 5
PMID 29617876
Citations 17
Authors
Affiliations
Soon will be listed here.
Abstract

Identifying transcription factor (TF) binding sites (TFBSs) is important in the computational inference of gene regulation. Widely used computational methods of TFBS prediction based on position weight matrices (PWMs) usually have high false positive rates. Moreover, computational studies of transcription regulation in eukaryotes frequently require numerous PWM models of TFBSs due to a large number of TFs involved. To overcome these problems we developed DRAF, a novel method for TFBS prediction that requires only 14 prediction models for 232 human TFs, while at the same time significantly improves prediction accuracy. DRAF models use more features than PWM models, as they combine information from TFBS sequences and physicochemical properties of TF DNA-binding domains into machine learning models. Evaluation of DRAF on 98 human ChIP-seq datasets shows on average 1.54-, 1.96- and 5.19-fold reduction of false positives at the same sensitivities compared to models from HOCOMOCO, TRANSFAC and DeepBind, respectively. This observation suggests that one can efficiently replace the PWM models for TFBS prediction by a small number of DRAF models that significantly improve prediction accuracy. The DRAF method is implemented in a web tool and in a stand-alone software freely available at http://cbrc.kaust.edu.sa/DRAF.

Citing Articles

Enhancer reprogramming: critical roles in cancer and promising therapeutic strategies.

Yang J, Zhou F, Luo X, Fang Y, Wang X, Liu X Cell Death Discov. 2025; 11(1):84.

PMID: 40032852 PMC: 11876437. DOI: 10.1038/s41420-025-02366-3.


Profiling conserved transcription factor binding motifs in Phaseolus vulgaris through comparative genomics.

Kondratova L, Vallejos C, Conesa A BMC Genomics. 2025; 26(1):169.

PMID: 39979816 PMC: 11841308. DOI: 10.1186/s12864-025-11309-2.


Abundant repressor binding sites in human enhancers are associated with the fine-tuning of gene regulation.

Song W, Ovcharenko I iScience. 2025; 28(1):111658.

PMID: 39868043 PMC: 11761325. DOI: 10.1016/j.isci.2024.111658.


The evaluation of transcription factor binding site prediction tools in human and Arabidopsis genomes.

Wanniarachchi D, Viswakula S, Wickramasuriya A BMC Bioinformatics. 2024; 25(1):371.

PMID: 39623329 PMC: 11613939. DOI: 10.1186/s12859-024-05995-0.


Recent advances in exploring transcriptional regulatory landscape of crops.

Huo Q, Song R, Ma Z Front Plant Sci. 2024; 15:1421503.

PMID: 38903438 PMC: 11188431. DOI: 10.3389/fpls.2024.1421503.


References
1.
Grant C, Bailey T, Noble W . FIMO: scanning for occurrences of a given motif. Bioinformatics. 2011; 27(7):1017-8. PMC: 3065696. DOI: 10.1093/bioinformatics/btr064. View

2.
Lefebvre C, Rieckhof G, Califano A . Reverse-engineering human regulatory networks. Wiley Interdiscip Rev Syst Biol Med. 2012; 4(4):311-25. PMC: 4128340. DOI: 10.1002/wsbm.1159. View

3.
Meysman P, Dang T, Laukens K, De Smet R, Wu Y, Marchal K . Use of structural DNA properties for the prediction of transcription-factor binding sites in Escherichia coli. Nucleic Acids Res. 2010; 39(2):e6. PMC: 3025552. DOI: 10.1093/nar/gkq1071. View

4.
Roulet E, Fisch I, Junier T, Bucher P, Mermod N . Evaluation of computer tools for the prediction of transcription factor binding sites on genomic DNA. In Silico Biol. 2001; 1(1):21-8. View

5.
Chen C, Chien T, Lin C, Lin C, Weng Y, Tien-Hao Chang D . Predicting target DNA sequences of DNA-binding proteins based on unbound structures. PLoS One. 2012; 7(2):e30446. PMC: 3270014. DOI: 10.1371/journal.pone.0030446. View