» Articles » PMID: 35671296

Identification of Upstream Transcription Factor Binding Sites in Orthologous Genes Using Mixed Student's T-test Statistics

Overview
Specialty Biology
Date 2022 Jun 7
PMID 35671296
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Transcription factor (TF) regulates the transcription of DNA to messenger RNA by binding to upstream sequence motifs. Identifying the locations of known motifs in whole genomes is computationally intensive.

Methodology/principal Findings: This study presents a computational tool, named "Grit", for screening TF-binding sites (TFBS) by coordinating transcription factors to their promoter sequences in orthologous genes. This tool employs a newly developed mixed Student's t-test statistical method that detects high-scoring binding sites utilizing conservation information among species. The program performs sequence scanning at a rate of 3.2 Mbp/s on a quad-core Amazon server and has been benchmarked by the well-established ChIP-Seq datasets, putting Grit amongst the top-ranked TFBS predictors. It significantly outperforms the well-known transcription factor motif scanning tools, Pscan (4.8%) and FIMO (17.8%), in analyzing well-documented ChIP-Atlas human genome Chip-Seq datasets.

Significance: Grit is a good alternative to current available motif scanning tools.

Citing Articles

Ornithine decarboxylase antizyme 2 (OAZ2) in human colon adenocarcinoma: a potent prognostic factor associated with immunity.

Liu Y, Zhang S, Liao W, Qian J, Lu C, Jin L Sci Rep. 2025; 15(1):7481.

PMID: 40032914 PMC: 11876682. DOI: 10.1038/s41598-025-90066-4.


Correlating gene expression levels with transcription factor binding sites facilitates identification of key transcription factors from transcriptome data.

Huang T, Niu S, Zhang F, Wang B, Wang J, Liu G Front Genet. 2024; 15:1511456.

PMID: 39678374 PMC: 11638204. DOI: 10.3389/fgene.2024.1511456.


Testing the Significance of Ranked Gene Sets in Genome-wide Transcriptome .

Yao M, He H, Wang B, Huang X, Zheng S, Wang J Curr Genomics. 2024; 25(3):202-211.

PMID: 39086999 PMC: 11288161. DOI: 10.2174/0113892029280470240306044159.

References
1.
Warner J, Philippakis A, Jaeger S, He F, Lin J, Bulyk M . Systematic identification of mammalian regulatory motifs' target genes and functions. Nat Methods. 2008; 5(4):347-53. PMC: 2708972. DOI: 10.1038/nmeth.1188. View

2.
Schneider T, Stormo G, Gold L, Ehrenfeucht A . Information content of binding sites on nucleotide sequences. J Mol Biol. 1986; 188(3):415-31. DOI: 10.1016/0022-2836(86)90165-8. View

3.
Fornes O, Castro-Mondragon J, Khan A, van der Lee R, Zhang X, Richmond P . JASPAR 2020: update of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 2019; 48(D1):D87-D92. PMC: 7145627. DOI: 10.1093/nar/gkz1001. View

4.
Elkon R, Linhart C, Sharan R, Shamir R, Shiloh Y . Genome-wide in silico identification of transcriptional regulators controlling the cell cycle in human cells. Genome Res. 2003; 13(5):773-80. PMC: 430898. DOI: 10.1101/gr.947203. View

5.
Oki S, Ohta T, Shioi G, Hatanaka H, Ogasawara O, Okuda Y . ChIP-Atlas: a data-mining suite powered by full integration of public ChIP-seq data. EMBO Rep. 2018; 19(12). PMC: 6280645. DOI: 10.15252/embr.201846255. View