» Articles » PMID: 22140105

ScerTF: a Comprehensive Database of Benchmarked Position Weight Matrices for Saccharomyces Species

Overview
Specialty Biochemistry
Date 2011 Dec 6
PMID 22140105
Citations 52
Authors
Affiliations
Soon will be listed here.
Abstract

Saccharomyces cerevisiae is a primary model for studies of transcriptional control, and the specificities of most yeast transcription factors (TFs) have been determined by multiple methods. However, it is unclear which position weight matrices (PWMs) are most useful; for the roughly 200 TFs in yeast, there are over 1200 PWMs in the literature. To address this issue, we created ScerTF, a comprehensive database of 1226 motifs from 11 different sources. We identified a single matrix for each TF that best predicts in vivo data by benchmarking matrices against chromatin immunoprecipitation and TF deletion experiments. We also used in vivo data to optimize thresholds for identifying regulatory sites with each matrix. To correct for biases from different methods, we developed a strategy to combine matrices. These aligned matrices outperform the best available matrix for several TFs. We used the matrices to predict co-occurring regulatory elements in the genome and identified many known TF combinations. In addition, we predict new combinations and provide evidence of combinatorial regulation from gene expression data. The database is available through a web interface at http://ural.wustl.edu/ScerTF. The site allows users to search the database with a regulatory site or matrix to identify the TFs most likely to bind the input sequence.

Citing Articles

Identification of DNA motif pairs on paired sequences based on composite heterogeneous graph.

Wu Q, Li Y, Wang Q, Zhao X, Sun D, Liu B Front Genet. 2024; 15:1424085.

PMID: 38952710 PMC: 11215013. DOI: 10.3389/fgene.2024.1424085.


Investigating pioneer factor activity and its coordination with chromatin remodelers using integrated synthetic oligo assay.

Chen H, Yan C, Dhasarathy A, Kladde M, Bai L STAR Protoc. 2023; 4(2):102279.

PMID: 37289591 PMC: 10323128. DOI: 10.1016/j.xpro.2023.102279.


Differential Hsp90-dependent gene expression is strain-specific and common among yeast strains.

Hung P, Liao C, Ko F, Tsai H, Leu J iScience. 2023; 26(5):106635.

PMID: 37138775 PMC: 10149407. DOI: 10.1016/j.isci.2023.106635.


Zinc cluster transcription factors frequently activate target genes using a non-canonical half-site binding mode.

Recio P, Mitra N, Shively C, Song D, Jaramillo G, Lewis K Nucleic Acids Res. 2023; 51(10):5006-5021.

PMID: 37125648 PMC: 10250231. DOI: 10.1093/nar/gkad320.


Chance promoter activities illuminate the origins of eukaryotic intergenic transcriptions.

Xu H, Li C, Xu C, Zhang J Nat Commun. 2023; 14(1):1826.

PMID: 37005399 PMC: 10067814. DOI: 10.1038/s41467-023-37610-w.


References
1.
Hertz G, Stormo G . Identifying DNA and protein patterns with statistically significant alignments of multiple sequences. Bioinformatics. 1999; 15(7-8):563-77. DOI: 10.1093/bioinformatics/15.7.563. View

2.
Berger M, Bulyk M . Universal protein-binding microarrays for the comprehensive characterization of the DNA-binding specificities of transcription factors. Nat Protoc. 2009; 4(3):393-411. PMC: 2908410. DOI: 10.1038/nprot.2008.195. View

3.
Foat B, Tepper R, Bussemaker H . TransfactomeDB: a resource for exploring the nucleotide sequence specificity and condition-specific regulatory activity of trans-acting factors. Nucleic Acids Res. 2007; 36(Database issue):D125-31. PMC: 2238954. DOI: 10.1093/nar/gkm828. View

4.
Wang T, Stormo G . Combining phylogenetic data with co-regulated genes to identify regulatory motifs. Bioinformatics. 2003; 19(18):2369-80. DOI: 10.1093/bioinformatics/btg329. View

5.
Yu J, Madison J, Mundlos S, Winston F, Olsen B . Characterization of a human homologue of the Saccharomyces cerevisiae transcription factor spt3 (SUPT3H). Genomics. 1998; 53(1):90-6. DOI: 10.1006/geno.1998.5500. View