Identification of Upstream Transcription Factor Binding Sites in Orthologous Genes Using Mixed Student's T-test Statistics
Overview
Affiliations
Background: Transcription factor (TF) regulates the transcription of DNA to messenger RNA by binding to upstream sequence motifs. Identifying the locations of known motifs in whole genomes is computationally intensive.
Methodology/principal Findings: This study presents a computational tool, named "Grit", for screening TF-binding sites (TFBS) by coordinating transcription factors to their promoter sequences in orthologous genes. This tool employs a newly developed mixed Student's t-test statistical method that detects high-scoring binding sites utilizing conservation information among species. The program performs sequence scanning at a rate of 3.2 Mbp/s on a quad-core Amazon server and has been benchmarked by the well-established ChIP-Seq datasets, putting Grit amongst the top-ranked TFBS predictors. It significantly outperforms the well-known transcription factor motif scanning tools, Pscan (4.8%) and FIMO (17.8%), in analyzing well-documented ChIP-Atlas human genome Chip-Seq datasets.
Significance: Grit is a good alternative to current available motif scanning tools.
Liu Y, Zhang S, Liao W, Qian J, Lu C, Jin L Sci Rep. 2025; 15(1):7481.
PMID: 40032914 PMC: 11876682. DOI: 10.1038/s41598-025-90066-4.
Huang T, Niu S, Zhang F, Wang B, Wang J, Liu G Front Genet. 2024; 15:1511456.
PMID: 39678374 PMC: 11638204. DOI: 10.3389/fgene.2024.1511456.
Testing the Significance of Ranked Gene Sets in Genome-wide Transcriptome .
Yao M, He H, Wang B, Huang X, Zheng S, Wang J Curr Genomics. 2024; 25(3):202-211.
PMID: 39086999 PMC: 11288161. DOI: 10.2174/0113892029280470240306044159.