» Articles » PMID: 17893074

Integration of Known Transcription Factor Binding Site Information and Gene Expression Data to Advance from Co-expression to Co-regulation

Overview
Specialty Biology
Date 2007 Sep 26
PMID 17893074
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

The common approach to find co-regulated genes is to cluster genes based on gene expression. However, due to the limited information present in any dataset, genes in the same cluster might be co-expressed but not necessarily co-regulated. In this paper, we propose to integrate known transcription factor binding site information and gene expression data into a single clustering scheme. This scheme will find clusters of co-regulated genes that are not only expressed similarly under the measured conditions, but also share a regulatory structure that may explain their common regulation. We demonstrate the utility of this approach on a microarray dataset of yeast grown under different nutrient and oxygen limitations. Our integrated clustering method not only unravels many regulatory modules that are consistent with current biological knowledge, but also provides a more profound understanding of the underlying process. The added value of our approach, compared with the clustering solely based on gene expression, is its ability to uncover clusters of genes that are involved in more specific biological processes and are evidently regulated by a set of transcription factors.

Citing Articles

Transcriptional regulation of proanthocyanidin biosynthesis pathway genes and transcription factors in Indigofera stachyodes Lindl. roots.

Wang C, Li J, Zhou T, Zhang Y, Jin H, Liu X BMC Plant Biol. 2022; 22(1):438.

PMID: 36096752 PMC: 9469613. DOI: 10.1186/s12870-022-03794-4.


Identification of biological pathway and process regulators using sparse partial least squares and triple-gene mutual interaction.

Hong J, Gunasekara C, He C, Liu S, Huang J, Wei H Sci Rep. 2021; 11(1):13174.

PMID: 34162988 PMC: 8222328. DOI: 10.1038/s41598-021-92610-4.


scLM: Automatic Detection of Consensus Gene Clusters Across Multiple Single-cell Datasets.

Song Q, Su J, Miller L, Zhang W Genomics Proteomics Bioinformatics. 2020; 19(2):330-341.

PMID: 33359676 PMC: 8602751. DOI: 10.1016/j.gpb.2020.09.002.


Evidence of widespread, independent sequence signature for transcription factor cobinding.

Zhou M, Li H, Wang X, Guan Y Genome Res. 2020; 31(2):265-278.

PMID: 33303494 PMC: 7849410. DOI: 10.1101/gr.267310.120.


Transcription factor binding site clusters identify target genes with similar tissue-wide expression and buffer against mutations.

Lu R, Rogan P F1000Res. 2019; 7:1933.

PMID: 31001412 PMC: 6464064. DOI: 10.12688/f1000research.17363.2.


References
1.
Heyer L, Kruglyak S, Yooseph S . Exploring expression data: identification and analysis of coexpressed genes. Genome Res. 1999; 9(11):1106-15. PMC: 310826. DOI: 10.1101/gr.9.11.1106. View

2.
van Helden J, Andre B, Collado-Vides J . Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies. J Mol Biol. 1998; 281(5):827-42. DOI: 10.1006/jmbi.1998.1947. View

3.
Dhaeseleer P . How does gene expression clustering work?. Nat Biotechnol. 2005; 23(12):1499-501. DOI: 10.1038/nbt1205-1499. View

4.
Zhang Z, Gu J, Gu X . How much expression divergence after yeast gene duplication could be explained by regulatory motif evolution?. Trends Genet. 2004; 20(9):403-7. DOI: 10.1016/j.tig.2004.07.006. View

5.
Alon U, Barkai N, Notterman D, Gish K, Ybarra S, Mack D . Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proc Natl Acad Sci U S A. 1999; 96(12):6745-50. PMC: 21986. DOI: 10.1073/pnas.96.12.6745. View