» Articles » PMID: 17963524

HeliCis: a DNA Motif Discovery Tool for Colocalized Motif Pairs with Periodic Spacing

Overview
Publisher Biomed Central
Specialty Biology
Date 2007 Oct 30
PMID 17963524
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Correct temporal and spatial gene expression during metazoan development relies on combinatorial interactions between different transcription factors. As a consequence, cis-regulatory elements often colocalize in clusters termed cis-regulatory modules. These may have requirements on organizational features such as spacing, order and helical phasing (periodic spacing) between binding sites. Due to the turning of the DNA helix, a small modification of the distance between a pair of sites may sometimes drastically disrupt function, while insertion of a full helical turn of DNA (10-11 bp) between cis elements may cause functionality to be restored. Recently, de novo motif discovery methods which incorporate organizational properties such as colocalization and order preferences have been developed, but there are no tools which incorporate periodic spacing into the model.

Results: We have developed a web based motif discovery tool, HeliCis, which features a flexible model which allows de novo detection of motifs with periodic spacing. Depending on the parameter settings it may also be used for discovering colocalized motifs without periodicity or motifs separated by a fixed gap of known or unknown length. We show on simulated data that it can efficiently capture the synergistic effects of colocalization and periodic spacing to improve detection of weak DNA motifs. It provides a simple to use web interface which interactively visualizes the current settings and thereby makes it easy to understand the parameters and the model structure.

Conclusion: HeliCis provides simple and efficient de novo discovery of colocalized DNA motif pairs, with or without periodic spacing. Our evaluations show that it can detect weak periodic patterns which are not easily discovered using a sequential approach, i.e. first finding the binding sites and second analyzing the properties of their pairwise distances.

Citing Articles

periodicDNA: an R/Bioconductor package to investigate k-mer periodicity in DNA.

Serizay J, Ahringer J F1000Res. 2021; 10:141.

PMID: 33953908 PMC: 8063535. DOI: 10.12688/f1000research.51143.1.


Unsupervised statistical discovery of spaced motifs in prokaryotic genomes.

Tong H, Schliekelman P, Mrazek J BMC Genomics. 2017; 18(1):27.

PMID: 28056763 PMC: 5217627. DOI: 10.1186/s12864-016-3400-0.


Identification of the RNA recognition element of the RBPMS family of RNA-binding proteins and their transcriptome-wide mRNA targets.

Farazi T, Leonhardt C, Mukherjee N, Mihailovic A, Li S, Max K RNA. 2014; 20(7):1090-102.

PMID: 24860013 PMC: 4114688. DOI: 10.1261/rna.045005.114.


Active motif finder - a bio-tool based on mutational structures in DNA sequences.

Udayakumar M, Shanmuga-Priya P, Hemavathi K, Seenivasagam R J Biomed Res. 2013; 25(6):444-8.

PMID: 23554723 PMC: 3596725. DOI: 10.1016/S1674-8301(11)60059-6.


HOCOMOCO: a comprehensive collection of human transcription factor binding sites models.

Kulakovskiy I, Medvedeva Y, Schaefer U, Kasianov A, Vorontsov I, Bajic V Nucleic Acids Res. 2012; 41(Database issue):D195-202.

PMID: 23175603 PMC: 3531053. DOI: 10.1093/nar/gks1089.


References
1.
Makeev V, Lifanov A, Nazina A, Papatsenko D . Distance preferences in the arrangement of binding motifs and hierarchical levels in organization of transcription regulatory information. Nucleic Acids Res. 2003; 31(20):6016-26. PMC: 219477. DOI: 10.1093/nar/gkg799. View

2.
Dryer R, Covey L . A novel NF-kappa B-regulated site within the human I gamma 1 promoter requires p300 for optimal transcriptional activity. J Immunol. 2005; 175(7):4499-507. DOI: 10.4049/jimmunol.175.7.4499. View

3.
Zhou Q, Wong W . CisModule: de novo discovery of cis-regulatory modules by hierarchical mixture modeling. Proc Natl Acad Sci U S A. 2004; 101(33):12114-9. PMC: 514443. DOI: 10.1073/pnas.0402858101. View

4.
Bailey T, Elkan C . Fitting a mixture model by expectation maximization to discover motifs in biopolymers. Proc Int Conf Intell Syst Mol Biol. 1994; 2:28-36. View

5.
Wang Z, Wang D, Hockemeyer D, McAnally J, Nordheim A, Olson E . Myocardin and ternary complex factors compete for SRF to control smooth muscle gene expression. Nature. 2004; 428(6979):185-9. DOI: 10.1038/nature02382. View