» Articles » PMID: 27507169

An Integrative and Applicable Phylogenetic Footprinting Framework for Cis-regulatory Motifs Identification in Prokaryotic Genomes

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2016 Aug 11
PMID 27507169
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction.

Results: Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes.

Conclusion: The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance progress in elucidating transcription regulation mechanism, thus provide benefit to the genomic research community and prokaryotic genome researchers in particular.

Citing Articles

The Transcriptomic Response of Cells of the Thermophilic Bacterium to Terahertz Irradiation.

Peltek S, Bannikova S, Khlebodarova T, Uvarova Y, Mukhin A, Vasiliev G Int J Mol Sci. 2024; 25(22).

PMID: 39596128 PMC: 11594194. DOI: 10.3390/ijms252212059.


Integrating genome sequence and structural data for statistical learning to predict transcription factor binding sites.

Long P, Zhang L, Huang B, Chen Q, Liu H Nucleic Acids Res. 2020; 48(22):12604-12617.

PMID: 33264415 PMC: 7736823. DOI: 10.1093/nar/gkaa1134.


RhizoBindingSites, a Database of DNA-Binding Motifs in Nitrogen-Fixing Bacteria Inferred Using a Footprint Discovery Approach.

Taboada-Castro H, Castro-Mondragon J, Aguilar-Vera A, Hernandez-Alvarez A, van Helden J, Encarnacion-Guevara S Front Microbiol. 2020; 11:567471.

PMID: 33250866 PMC: 7674921. DOI: 10.3389/fmicb.2020.567471.


Perspectives of CRISPR/Cas-mediated -engineering in horticulture: unlocking the neglected potential for crop improvement.

Li Q, Sapkota M, Knaap E Hortic Res. 2020; 7:36.

PMID: 32194972 PMC: 7072075. DOI: 10.1038/s41438-020-0258-8.


Genome-scale exploration of transcriptional regulation in the nisin Z producer Lactococcus lactis subsp. lactis IO-1.

Poorinmohammad N, Hamedi J, Masoudi-Nejad A Sci Rep. 2020; 10(1):3787.

PMID: 32123183 PMC: 7051946. DOI: 10.1038/s41598-020-59731-8.


References
1.
Gruber T, Gross C . Multiple sigma subunits and the partitioning of bacterial transcription space. Annu Rev Microbiol. 2003; 57:441-66. DOI: 10.1146/annurev.micro.57.030502.090913. View

2.
McCue L, Thompson W, Carmack C, Ryan M, Liu J, Derbyshire V . Phylogenetic footprinting of transcription factor binding sites in proteobacterial genomes. Nucleic Acids Res. 2001; 29(3):774-82. PMC: 30389. DOI: 10.1093/nar/29.3.774. View

3.
Tagle D, Koop B, Goodman M, Slightom J, Hess D, Jones R . Embryonic epsilon and gamma globin genes of a prosimian primate (Galago crassicaudatus). Nucleotide and amino acid sequences, developmental regulation and phylogenetic footprints. J Mol Biol. 1988; 203(2):439-55. DOI: 10.1016/0022-2836(88)90011-3. View

4.
Wang T, Stormo G . Combining phylogenetic data with co-regulated genes to identify regulatory motifs. Bioinformatics. 2003; 19(18):2369-80. DOI: 10.1093/bioinformatics/btg329. View

5.
Manson McGuire A, Church G . Predicting regulons and their cis-regulatory motifs by comparative genomics. Nucleic Acids Res. 2000; 28(22):4523-30. PMC: 113887. DOI: 10.1093/nar/28.22.4523. View