» Articles » PMID: 12582243

Evolution of Transcription Factors and the Gene Regulatory Network in Escherichia Coli

Overview
Specialty Biochemistry
Date 2003 Feb 13
PMID 12582243
Citations 124
Authors
Affiliations
Soon will be listed here.
Abstract

The most detailed information presently available for an organism's transcriptional regulation network is that for the prokaryote Escherichia coli. In order to gain insight into the evolution of the E.coli regulatory network, we analysed information obtainable for the domains and protein families of the transcription factors and regulated genes. About three-quarters of the 271 transcription factors we identified are two-domain proteins, consisting of a DNA-binding domain along with a regulatory domain. The regulatory domains mainly bind small molecules. Many groups of transcription factors have identical domain architectures, and this implies that roughly three-quarters of the transcription factors have arisen as a consequence of gene duplication. In contrast, there is little evidence of duplication of regulatory regions together with regulated genes or of transcription factors together with regulated genes. Thirty-eight, out of the 121 transcription factors for which one or more regulated genes are known, regulate other transcription factors. This amplification effect, as well as large differences between the numbers of genes directly regulated by transcription factors, means that there are about 10 global regulators which each control many more genes than the other transcription factors.

Citing Articles

Diversity of Transcriptional Regulatory Adaptation in E. coli.

Dalldorf C, Hefner Y, Szubin R, Johnsen J, Mohamed E, Li G Mol Biol Evol. 2024; 41(11).

PMID: 39531644 PMC: 11588850. DOI: 10.1093/molbev/msae240.


Structure classification of the proteins from Salmonella enterica pangenome revealed novel potential pathogenicity islands.

Medvedev K, Zhang J, Schaeffer R, Kinch L, Cong Q, Grishin N Sci Rep. 2024; 14(1):12260.

PMID: 38806511 PMC: 11133325. DOI: 10.1038/s41598-024-60991-x.


GNNMF: a multi-view graph neural network for ATAC-seq motif finding.

Zhang S, Wu X, Lian Z, Zuo C, Wang Y BMC Genomics. 2024; 25(1):300.

PMID: 38515040 PMC: 10956247. DOI: 10.1186/s12864-024-10218-0.


Synonymous edits in the genome have substantial and condition-dependent effects on fitness.

Yang D, Rusch L, Widney K, Morgenthaler A, Copley S Proc Natl Acad Sci U S A. 2024; 121(5):e2316834121.

PMID: 38252823 PMC: 10835057. DOI: 10.1073/pnas.2316834121.


Data mining and mathematical models in cancer prognosis and prediction.

Yu C, Wang J Med Rev (2021). 2023; 2(3):285-307.

PMID: 37724193 PMC: 10388766. DOI: 10.1515/mr-2021-0026.


References
1.
Madera M, Gough J . A comparison of profile hidden Markov model procedures for remote homology detection. Nucleic Acids Res. 2002; 30(19):4321-8. PMC: 140544. DOI: 10.1093/nar/gkf544. View

2.
Martin R, Gillette W, Martin N, Rosner J . Complex formation between activator and RNA polymerase as the basis for transcriptional activation by MarA and SoxS in Escherichia coli. Mol Microbiol. 2002; 43(2):355-70. DOI: 10.1046/j.1365-2958.2002.02748.x. View

3.
Perez-Rueda E, Collado-Vides J . The repertoire of DNA-binding transcriptional regulators in Escherichia coli K-12. Nucleic Acids Res. 2000; 28(8):1838-47. PMC: 102813. DOI: 10.1093/nar/28.8.1838. View

4.
Kwon H, Bennik M, Demple B, Ellenberger T . Crystal structure of the Escherichia coli Rob transcription factor in complex with DNA. Nat Struct Biol. 2000; 7(5):424-30. DOI: 10.1038/75213. View

5.
Tatusov R, Natale D, Garkavtsev I, Tatusova T, Shankavaram U, Rao B . The COG database: new developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 2000; 29(1):22-8. PMC: 29819. DOI: 10.1093/nar/29.1.22. View