Integration of Known Transcription Factor Binding Site Information and Gene Expression Data to Advance from Co-expression to Co-regulation
Overview
Affiliations
The common approach to find co-regulated genes is to cluster genes based on gene expression. However, due to the limited information present in any dataset, genes in the same cluster might be co-expressed but not necessarily co-regulated. In this paper, we propose to integrate known transcription factor binding site information and gene expression data into a single clustering scheme. This scheme will find clusters of co-regulated genes that are not only expressed similarly under the measured conditions, but also share a regulatory structure that may explain their common regulation. We demonstrate the utility of this approach on a microarray dataset of yeast grown under different nutrient and oxygen limitations. Our integrated clustering method not only unravels many regulatory modules that are consistent with current biological knowledge, but also provides a more profound understanding of the underlying process. The added value of our approach, compared with the clustering solely based on gene expression, is its ability to uncover clusters of genes that are involved in more specific biological processes and are evidently regulated by a set of transcription factors.
Wang C, Li J, Zhou T, Zhang Y, Jin H, Liu X BMC Plant Biol. 2022; 22(1):438.
PMID: 36096752 PMC: 9469613. DOI: 10.1186/s12870-022-03794-4.
Hong J, Gunasekara C, He C, Liu S, Huang J, Wei H Sci Rep. 2021; 11(1):13174.
PMID: 34162988 PMC: 8222328. DOI: 10.1038/s41598-021-92610-4.
scLM: Automatic Detection of Consensus Gene Clusters Across Multiple Single-cell Datasets.
Song Q, Su J, Miller L, Zhang W Genomics Proteomics Bioinformatics. 2020; 19(2):330-341.
PMID: 33359676 PMC: 8602751. DOI: 10.1016/j.gpb.2020.09.002.
Evidence of widespread, independent sequence signature for transcription factor cobinding.
Zhou M, Li H, Wang X, Guan Y Genome Res. 2020; 31(2):265-278.
PMID: 33303494 PMC: 7849410. DOI: 10.1101/gr.267310.120.
Lu R, Rogan P F1000Res. 2019; 7:1933.
PMID: 31001412 PMC: 6464064. DOI: 10.12688/f1000research.17363.2.