» Articles » PMID: 14534166

Searching for Statistically Significant Regulatory Modules

Overview
Journal Bioinformatics
Specialty Biology
Date 2003 Oct 10
PMID 14534166
Citations 73
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: The regulatory machinery controlling gene expression is complex, frequently requiring multiple, simultaneous DNA-protein interactions. The rate at which a gene is transcribed may depend upon the presence or absence of a collection of transcription factors bound to the DNA near the gene. Locating transcription factor binding sites in genomic DNA is difficult because the individual sites are small and tend to occur frequently by chance. True binding sites may be identified by their tendency to occur in clusters, sometimes known as regulatory modules.

Results: We describe an algorithm for detecting occurrences of regulatory modules in genomic DNA. The algorithm, called mcast, takes as input a DNA database and a collection of binding site motifs that are known to operate in concert. mcast uses a motif-based hidden Markov model with several novel features. The model incorporates motif-specific p-values, thereby allowing scores from motifs of different widths and specificities to be compared directly. The p-value scoring also allows mcast to only accept motif occurrences with significance below a user-specified threshold, while still assigning better scores to motif occurrences with lower p-values. mcast can search long DNA sequences, modeling length distributions between motifs within a regulatory module, but ignoring length distributions between modules. The algorithm produces a list of predicted regulatory modules, ranked by E-value. We validate the algorithm using simulated data as well as real data sets from fruitfly and human.

Availability: http://meme.sdsc.edu/MCAST/paper

Citing Articles

BestCRM: An Exhaustive Search for Optimal Cis-Regulatory Modules in Promoters Accelerated by the Multidimensional Hash Function.

Deyneko I Int J Mol Sci. 2024; 25(3).

PMID: 38339181 PMC: 10856692. DOI: 10.3390/ijms25031903.


In-silico identification and comparison of transcription factor binding sites cluster in anterior-posterior patterning genes in Drosophila melanogaster and Tribolium castaneum.

Moudgil A, Sobti R, Kaur T PLoS One. 2023; 18(8):e0290035.

PMID: 37590227 PMC: 10434971. DOI: 10.1371/journal.pone.0290035.


A survey on algorithms to characterize transcription factor binding sites.

Tognon M, Giugno R, Pinello L Brief Bioinform. 2023; 24(3).

PMID: 37099664 PMC: 10422928. DOI: 10.1093/bib/bbad156.


Dimeric p53 Mutant Elicits Unique Tumor-Suppressive Activities through an Altered Metabolic Program.

Gencel-Augusto J, Su X, Qi Y, Whitley E, Pant V, Xiong S Cancer Discov. 2023; 13(5):1230-1249.

PMID: 37067911 PMC: 10164062. DOI: 10.1158/2159-8290.CD-22-0872.


Replication stress generates distinctive landscapes of DNA copy number alterations and chromosome scale losses.

Shaikh N, Mazzagatti A, De Angelis S, Johnson S, Bakker B, Spierings D Genome Biol. 2022; 23(1):223.

PMID: 36266663 PMC: 9583511. DOI: 10.1186/s13059-022-02781-0.