» Articles » PMID: 19997485

Inferring Binding Energies from Selected Binding Sites

Overview
Specialty Biology
Date 2009 Dec 10
PMID 19997485
Citations 122
Authors
Affiliations
Soon will be listed here.
Abstract

We employ a biophysical model that accounts for the non-linear relationship between binding energy and the statistics of selected binding sites. The model includes the chemical potential of the transcription factor, non-specific binding affinity of the protein for DNA, as well as sequence-specific parameters that may include non-independent contributions of bases to the interaction. We obtain maximum likelihood estimates for all of the parameters and compare the results to standard probabilistic methods of parameter estimation. On simulated data, where the true energy model is known and samples are generated with a variety of parameter values, we show that our method returns much more accurate estimates of the true parameters and much better predictions of the selected binding site distributions. We also introduce a new high-throughput SELEX (HT-SELEX) procedure to determine the binding specificity of a transcription factor in which the initial randomized library and the selected sites are sequenced with next generation methods that return hundreds of thousands of sites. We show that after a single round of selection our method can estimate binding parameters that give very good fits to the selected site distributions, much better than standard motif identification algorithms.

Citing Articles

ShapeME: A tool and web front-end for de novo discovery of structural motifs underpinning protein-DNA interactions.

Schroeder J, Wolfe M, Freddolino L bioRxiv. 2025; .

PMID: 39975017 PMC: 11838363. DOI: 10.1101/2025.01.28.635290.


Active learning of enhancers and silencers in the developing neural retina.

Friedman R, Ramu A, Lichtarge S, Wu Y, Tripp L, Lyon D Cell Syst. 2025; 16(1):101163.

PMID: 39778579 PMC: 11827711. DOI: 10.1016/j.cels.2024.12.004.


Geometric deep learning of protein-DNA binding specificity.

Mitra R, Li J, Sagendorf J, Jiang Y, Cohen A, Chiu T Nat Methods. 2024; 21(9):1674-1683.

PMID: 39103447 PMC: 11399107. DOI: 10.1038/s41592-024-02372-w.


Active learning of enhancer and silencer regulatory grammar in photoreceptors.

Friedman R, Ramu A, Lichtarge S, Myers C, Granas D, Gause M bioRxiv. 2023; .

PMID: 37662358 PMC: 10473580. DOI: 10.1101/2023.08.21.554146.


Physicochemical models of protein-DNA binding with standard and modified base pairs.

Chiu T, Rao S, Rohs R Proc Natl Acad Sci U S A. 2023; 120(4):e2205796120.

PMID: 36656856 PMC: 9942898. DOI: 10.1073/pnas.2205796120.


References
1.
Zhang M, Marr T . A weight array method for splicing signal analysis. Comput Appl Biosci. 1993; 9(5):499-509. DOI: 10.1093/bioinformatics/9.5.499. View

2.
Linnell J, Mott R, Field S, Kwiatkowski D, Ragoussis J, Udalova I . Quantitative high-throughput analysis of transcription factor binding specificities. Nucleic Acids Res. 2004; 32(4):e44. PMC: 390317. DOI: 10.1093/nar/gnh042. View

3.
Benos P, Bulyk M, Stormo G . Additivity in protein-DNA interactions: how good an approximation is it?. Nucleic Acids Res. 2002; 30(20):4442-51. PMC: 137142. DOI: 10.1093/nar/gkf578. View

4.
Wright W, Binder M, Funk W . Cyclic amplification and selection of targets (CASTing) for the myogenin consensus binding site. Mol Cell Biol. 1991; 11(8):4104-10. PMC: 361222. DOI: 10.1128/mcb.11.8.4104-4110.1991. View

5.
Stormo G, Fields D . Specificity, free energy and information content in protein-DNA interactions. Trends Biochem Sci. 1998; 23(3):109-13. DOI: 10.1016/s0968-0004(98)01187-6. View