» Articles » PMID: 17137509

Cis-motifs Upstream of the Transcription and Translation Initiation Sites Are Effectively Revealed by Their Positional Disequilibrium in Eukaryote Genomes Using Frequency Distribution Curves

Overview
Publisher Biomed Central
Specialty Biology
Date 2006 Dec 2
PMID 17137509
Citations 30
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The discovery of cis-regulatory motifs still remains a challenging task even though the number of sequenced genomes is constantly growing. Computational analyses using pattern search algorithms have been valuable in phylogenetic footprinting approaches as have expression profile experiments to predict co-occurring motifs. Surprisingly little is known about the nature of cis-regulatory element (CRE) distribution in promoters.

Results: In this paper we used the Motif Mapper open-source collection of visual basic scripts for the analysis of motifs in any aligned set of DNA sequences. We focused on promoter motif distribution curves to identify positional over-representation of DNA motifs. Using differentially aligned datasets from the model species Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster and Saccharomyces cerevisiae, we convincingly demonstrated the importance of the position and orientation for motif discovery. Analysis with known CREs and all possible hexanucleotides showed that some functional elements gather close to the transcription and translation initiation sites and that elements other than the TATA-box motif are conserved between eukaryote promoters. While a high background frequency usually decreases the effectiveness of such an enumerative investigation, we improved our analysis by conducting motif distribution maps using large datasets.

Conclusion: This is the first study to reveal positional over-representation of CREs and promoter motifs in a cross-species approach. CREs and motifs shared between eukaryotic promoters support the observation that an eukaryotic promoter structure has been conserved throughout evolutionary time. Furthermore, with the information on positional enrichment of a motif or a known functional CRE, it is possible to get a more detailed insight into where an element appears to function. This in turn might accelerate the in depth examination of known and yet unknown cis-regulatory sequences in the laboratory.

Citing Articles

Comprehensive identification of GASA genes in sunflower and expression profiling in response to drought.

Asad Ullah M, Ahmed M, AlHusnain L, Zia M, AlKahtani M, Attia K BMC Genomics. 2024; 25(1):954.

PMID: 39402437 PMC: 11472593. DOI: 10.1186/s12864-024-10860-8.


Transcription factors organize into functional groups on the linear genome and in 3D chromatin.

Vadnala R, Hannenhalli S, Narlikar L, Siddharthan R Heliyon. 2023; 9(8):e18211.

PMID: 37520992 PMC: 10382302. DOI: 10.1016/j.heliyon.2023.e18211.


Prediction of Rice Transcription Start Sites Using TransPrise: A Novel Machine Learning Approach.

Pachganov S, Murtazalieva K, Zarubin A, Taran T, Chartier D, Tatarinova T Methods Mol Biol. 2021; 2238:261-274.

PMID: 33471337 DOI: 10.1007/978-1-0716-1068-8_17.


TransPrise: a novel machine learning approach for eukaryotic promoter prediction.

Pachganov S, Murtazalieva K, Zarubin A, Sokolov D, Chartier D, Tatarinova T PeerJ. 2019; 7:e7990.

PMID: 31695967 PMC: 6827441. DOI: 10.7717/peerj.7990.


Phylogenetic Analyses and GAGA-Motif Binding Studies of BBR/BPC Proteins Lend to Clues in GAGA-Motif Recognition and a Regulatory Role in Brassinosteroid Signaling.

Theune M, Bloss U, Brand L, Ladwig F, Wanke D Front Plant Sci. 2019; 10:466.

PMID: 31057577 PMC: 6477699. DOI: 10.3389/fpls.2019.00466.


References
1.
Ulmasov T, Murfett J, Hagen G, Guilfoyle T . Aux/IAA proteins repress expression of reporter genes containing natural and highly active synthetic auxin response elements. Plant Cell. 1997; 9(11):1963-71. PMC: 157050. DOI: 10.1105/tpc.9.11.1963. View

2.
Katti M, Sakharkar M, Ranjekar P, Gupta V . TRES: comparative promoter sequence analysis. Bioinformatics. 2000; 16(8):739-40. DOI: 10.1093/bioinformatics/16.8.739. View

3.
Hampsey M . Molecular genetics of the RNA polymerase II general transcriptional machinery. Microbiol Mol Biol Rev. 1998; 62(2):465-503. PMC: 98922. DOI: 10.1128/MMBR.62.2.465-503.1998. View

4.
Higo K, Ugawa Y, Iwamoto M, Korenaga T . Plant cis-acting regulatory DNA elements (PLACE) database: 1999. Nucleic Acids Res. 1998; 27(1):297-300. PMC: 148163. DOI: 10.1093/nar/27.1.297. View

5.
Zhu J, Zhang M . SCPD: a promoter database of the yeast Saccharomyces cerevisiae. Bioinformatics. 1999; 15(7-8):607-11. DOI: 10.1093/bioinformatics/15.7.607. View