» Articles » PMID: 25704908

Identifying Transcriptional Cis-regulatory Modules in Animal Genomes

Overview
Specialty Biology
Date 2015 Feb 24
PMID 25704908
Citations 37
Authors
Affiliations
Soon will be listed here.
Abstract

Unlabelled: Gene expression is regulated through the activity of transcription factors (TFs) and chromatin-modifying proteins acting on specific DNA sequences, referred to as cis-regulatory elements. These include promoters, located at the transcription initiation sites of genes, and a variety of distal cis-regulatory modules (CRMs), the most common of which are transcriptional enhancers. Because regulated gene expression is fundamental to cell differentiation and acquisition of new cell fates, identifying, characterizing, and understanding the mechanisms of action of CRMs is critical for understanding development. CRM discovery has historically been challenging, as CRMs can be located far from the genes they regulate, have few readily identifiable sequence characteristics, and for many years were not amenable to high-throughput discovery methods. However, the recent availability of complete genome sequences and the development of next-generation sequencing methods have led to an explosion of both computational and empirical methods for CRM discovery in model and nonmodel organisms alike. Experimentally, CRMs can be identified through chromatin immunoprecipitation directed against TFs or histone post-translational modifications, identification of nucleosome-depleted 'open' chromatin regions, or sequencing-based high-throughput functional screening. Computational methods include comparative genomics, clustering of known or predicted TF-binding sites, and supervised machine-learning approaches trained on known CRMs. All of these methods have proven effective for CRM discovery, but each has its own considerations and limitations, and each is subject to a greater or lesser number of false-positive identifications. Experimental confirmation of predictions is essential, although shortcomings in current methods suggest that additional means of validation need to be developed. For further resources related to this article, please visit the WIREs website.

Conflict Of Interest: The authors have declared no conflicts of interest for this article.

Citing Articles

SCRMshaw: Supervised cis-regulatory module prediction for insect genomes.

Asma H, Liu L, Halfon M PLoS One. 2024; 19(12):e0311752.

PMID: 39637210 PMC: 11620701. DOI: 10.1371/journal.pone.0311752.


Regulatory genome annotation of 33 insect species.

Asma H, Tieke E, Deem K, Rahmat J, Dong T, Huang X Elife. 2024; 13.

PMID: 39392676 PMC: 11469670. DOI: 10.7554/eLife.96738.


Evolution of Microglia.

Guffart E, Prinz M Adv Neurobiol. 2024; 37:39-51.

PMID: 39207685 DOI: 10.1007/978-3-031-55529-9_3.


Conserved and novel enhancers in the Aedes aegypti single-minded locus recapitulate embryonic ventral midline gene expression.

Schember I, Reid W, Sterling-Lentsch G, Halfon M PLoS Genet. 2024; 20(4):e1010891.

PMID: 38683842 PMC: 11081499. DOI: 10.1371/journal.pgen.1010891.


Towards a comprehensive regulatory map of Mammalian Genomes.

Mangetti Goncalves T, Stewart C, Baxley S, Xu J, Li D, Gabel H Res Sq. 2023; .

PMID: 37841836 PMC: 10571623. DOI: 10.21203/rs.3.rs-3294408/v1.


References
1.
Sanyal A, Lajoie B, Jain G, Dekker J . The long-range interaction landscape of gene promoters. Nature. 2012; 489(7414):109-13. PMC: 3555147. DOI: 10.1038/nature11279. View

2.
Berger M, Philippakis A, Qureshi A, He F, Estep 3rd P, Bulyk M . Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities. Nat Biotechnol. 2006; 24(11):1429-35. PMC: 4419707. DOI: 10.1038/nbt1246. View

3.
Cho K . Enhancers. Wiley Interdiscip Rev Dev Biol. 2013; 1(4):469-78. PMC: 3979590. DOI: 10.1002/wdev.53. View

4.
Rhee H, Pugh B . Comprehensive genome-wide protein-DNA interactions detected at single-nucleotide resolution. Cell. 2011; 147(6):1408-19. PMC: 3243364. DOI: 10.1016/j.cell.2011.11.013. View

5.
Kazemian M, Zhu Q, Halfon M, Sinha S . Improved accuracy of supervised CRM discovery with interpolated Markov models and cross-species comparison. Nucleic Acids Res. 2011; 39(22):9463-72. PMC: 3239187. DOI: 10.1093/nar/gkr621. View