» Articles » PMID: 20378718

Multiplexed Massively Parallel SELEX for Characterization of Human Transcription Factor Binding Specificities

Abstract

The genetic code-the binding specificity of all transfer-RNAs--defines how protein primary structure is determined by DNA sequence. DNA also dictates when and where proteins are expressed, and this information is encoded in a pattern of specific sequence motifs that are recognized by transcription factors. However, the DNA-binding specificity is only known for a small fraction of the approximately 1400 human transcription factors (TFs). We describe here a high-throughput method for analyzing transcription factor binding specificity that is based on systematic evolution of ligands by exponential enrichment (SELEX) and massively parallel sequencing. The method is optimized for analysis of large numbers of TFs in parallel through the use of affinity-tagged proteins, barcoded selection oligonucleotides, and multiplexed sequencing. Data are analyzed by a new bioinformatic platform that uses the hundreds of thousands of sequencing reads obtained to control the quality of the experiments and to generate binding motifs for the TFs. The described technology allows higher throughput and identification of much longer binding profiles than current microarray-based methods. In addition, as our method is based on proteins expressed in mammalian cells, it can also be used to characterize DNA-binding preferences of full-length proteins or proteins requiring post-translational modifications. We validate the method by determining binding specificities of 14 different classes of TFs and by confirming the specificities for NFATC1 and RFX3 using ChIP-seq. Our results reveal unexpected dimeric modes of binding for several factors that were thought to preferentially bind DNA as monomers.

Citing Articles

Biological Switches: Past and Future Milestones of Transcription Factor-Based Biosensors.

De Paepe B, De Mey M ACS Synth Biol. 2024; 14(1):72-86.

PMID: 39709556 PMC: 11745168. DOI: 10.1021/acssynbio.4c00689.


Genome-wide single-cell and single-molecule footprinting of transcription factors with deaminase.

He R, Dong W, Wang Z, Xie C, Gao L, Ma W Proc Natl Acad Sci U S A. 2024; 121(52):e2423270121.

PMID: 39689177 PMC: 11670102. DOI: 10.1073/pnas.2423270121.


Cross-platform DNA motif discovery and benchmarking to explore binding specificities of poorly studied human transcription factors.

Vorontsov I, Kozin I, Abramov S, Boytsov A, Jolma A, Albu M bioRxiv. 2024; .

PMID: 39605530 PMC: 11601219. DOI: 10.1101/2024.11.11.619379.


GHT-SELEX demonstrates unexpectedly high intrinsic sequence specificity and complex DNA binding of many human transcription factors.

Jolma A, Hernandez-Corchado A, Yang A, Fathi A, Laverty K, Brechalov A bioRxiv. 2024; .

PMID: 39605368 PMC: 11601218. DOI: 10.1101/2024.11.11.618478.


Extensive binding of uncharacterized human transcription factors to genomic dark matter.

Razavi R, Fathi A, Yellan I, Brechalov A, Laverty K, Jolma A bioRxiv. 2024; .

PMID: 39605320 PMC: 11601254. DOI: 10.1101/2024.11.11.622123.


References
1.
Berger M, Philippakis A, Qureshi A, He F, Estep 3rd P, Bulyk M . Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities. Nat Biotechnol. 2006; 24(11):1429-35. PMC: 4419707. DOI: 10.1038/nbt1246. View

2.
Kroeger P, Morimoto R . Selection of new HSF1 and HSF2 DNA-binding sites reveals difference in trimer cooperativity. Mol Cell Biol. 1994; 14(11):7592-603. PMC: 359295. DOI: 10.1128/mcb.14.11.7592-7603.1994. View

3.
Roulet E, Busso S, Camargo A, Simpson A, Mermod N, Bucher P . High-throughput SELEX SAGE method for quantitative modeling of transcription-factor binding sites. Nat Biotechnol. 2002; 20(8):831-5. DOI: 10.1038/nbt718. View

4.
Berger M, Badis G, Gehrke A, Talukder S, Philippakis A, Pena-Castillo L . Variation in homeodomain DNA binding revealed by high-resolution analysis of sequence preferences. Cell. 2008; 133(7):1266-76. PMC: 2531161. DOI: 10.1016/j.cell.2008.05.024. View

5.
Bryne J, Valen E, Tang M, Marstrand T, Winther O, da Piedade I . JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update. Nucleic Acids Res. 2007; 36(Database issue):D102-6. PMC: 2238834. DOI: 10.1093/nar/gkm955. View