» Articles » PMID: 26721389

ConBind: Motif-aware Cross-species Alignment for the Identification of Functional Transcription Factor Binding Sites

Overview
Specialty Biochemistry
Date 2016 Jan 2
PMID 26721389
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Eukaryotic gene expression is regulated by transcription factors (TFs) binding to promoter as well as distal enhancers. TFs recognize short, but specific binding sites (TFBSs) that are located within the promoter and enhancer regions. Functionally relevant TFBSs are often highly conserved during evolution leaving a strong phylogenetic signal. While multiple sequence alignment (MSA) is a potent tool to detect the phylogenetic signal, the current MSA implementations are optimized to align the maximum number of identical nucleotides. This approach might result in the omission of conserved motifs that contain interchangeable nucleotides such as the ETS motif (IUPAC code: GGAW). Here, we introduce ConBind, a novel method to enhance alignment of short motifs, even if their mutual sequence similarity is only partial. ConBind improves the identification of conserved TFBSs by improving the alignment accuracy of TFBS families within orthologous DNA sequences. Functional validation of the Gfi1b + 13 enhancer reveals that ConBind identifies additional functionally important ETS binding sites that were missed by all other tested alignment tools. In addition to the analysis of known regulatory regions, our web tool is useful for the analysis of TFBSs on so far unknown DNA regions identified through ChIP-sequencing.

Citing Articles

The Dynamic Changes of Transcription Factors During the Development Processes of Human Biparental and Uniparental Embryos.

Zhang C, Li C, Yang L, Leng L, Jovic D, Wang J Front Cell Dev Biol. 2021; 9:709498.

PMID: 34604214 PMC: 8484909. DOI: 10.3389/fcell.2021.709498.


Refining pairwise sequence alignments of membrane proteins by the incorporation of anchors.

Staritzbichler R, Sarti E, Yaklich E, Aleksandrova A, Stamm M, Khafizov K PLoS One. 2021; 16(4):e0239881.

PMID: 33930031 PMC: 8087094. DOI: 10.1371/journal.pone.0239881.


Tailor-made multiple sequence alignments using the PRALINE 2 alignment toolkit.

Dijkstra M, van der Ploeg A, Feenstra K, Fokkink W, Abeln S, Heringa J Bioinformatics. 2019; 35(24):5315-5317.

PMID: 31368486 PMC: 6954659. DOI: 10.1093/bioinformatics/btz572.


Motif-Aware PRALINE: Improving the alignment of motif regions.

Dijkstra M, Bawono P, Abeln S, Feenstra K, Fokkink W, Heringa J PLoS Comput Biol. 2018; 14(11):e1006547.

PMID: 30383764 PMC: 6233922. DOI: 10.1371/journal.pcbi.1006547.


An experimentally validated network of nine haematopoietic transcription factors reveals mechanisms of cell state stability.

Schutte J, Wang H, Antoniou S, Jarratt A, Wilson N, Riepsaame J Elife. 2016; 5:e11469.

PMID: 26901438 PMC: 4798972. DOI: 10.7554/eLife.11469.

References
1.
Soldaini E, John S, Moro S, Bollenbacher J, Schindler U, Leonard W . DNA binding site selection of dimeric and tetrameric Stat5 proteins reveals a large repertoire of divergent tetrameric Stat5a binding sites. Mol Cell Biol. 1999; 20(1):389-401. PMC: 85094. DOI: 10.1128/MCB.20.1.389-401.2000. View

2.
Bawono P, Heringa J . PRALINE: a versatile multiple sequence alignment toolkit. Methods Mol Biol. 2013; 1079:245-62. DOI: 10.1007/978-1-62703-646-7_16. View

3.
Notredame C, Higgins D, Heringa J . T-Coffee: A novel method for fast and accurate multiple sequence alignment. J Mol Biol. 2000; 302(1):205-17. DOI: 10.1006/jmbi.2000.4042. View

4.
Ikeda Y, Yamamoto J, Okamura M, Fujino T, Takahashi S, Takeuchi K . Transcriptional regulation of the murine acetyl-CoA synthetase 1 gene through multiple clustered binding sites for sterol regulatory element-binding proteins and a single neighboring site for Sp1. J Biol Chem. 2001; 276(36):34259-69. DOI: 10.1074/jbc.M103848200. View

5.
Kent W, Sugnet C, Furey T, Roskin K, Pringle T, Zahler A . The human genome browser at UCSC. Genome Res. 2002; 12(6):996-1006. PMC: 186604. DOI: 10.1101/gr.229102. View