» Articles » PMID: 34050351

TSSFinder-fast and Accurate Ab Initio Prediction of the Core Promoter in Eukaryotic Genomes

Overview
Journal Brief Bioinform
Specialty Biology
Date 2021 May 29
PMID 34050351
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Promoter annotation is an important task in the analysis of a genome. One of the main challenges for this task is locating the border between the promoter region and the transcribing region of the gene, the transcription start site (TSS). The TSS is the reference point to delimit the DNA sequence responsible for the assembly of the transcribing complex. As the same gene can have more than one TSS, so to delimit the promoter region, it is important to locate the closest TSS to the site of the beginning of the translation. This paper presents TSSFinder, a new software for the prediction of the TSS signal of eukaryotic genes that is significantly more accurate than other available software. We currently are the only application to offer pre-trained models for six different eukaryotic organisms: Arabidopsis thaliana, Drosophila melanogaster, Gallus gallus, Homo sapiens, Oryza sativa and Saccharomyces cerevisiae. Additionally, our software can be easily customized for specific organisms using only 125 DNA sequences with a validated TSS signal and corresponding genomic locations as a training set. TSSFinder is a valuable new tool for the annotation of genomes. TSSFinder source code and docker container can be downloaded from http://tssfinder.github.io. Alternatively, TSSFinder is also available as a web service at http://sucest-fun.org/wsapp/tssfinder/.

Citing Articles

Noncanonical transcription initiation is primarily tissue specific and epigenetically tuned in paleopolyploid plants.

Wang X, Duan J, Clark C, Feng W, Ma J Plant Cell. 2024; 37(1).

PMID: 39540911 PMC: 11663555. DOI: 10.1093/plcell/koae288.


Classification of Promoter Sequences from Human Genome.

Zaytsev K, Fedorov A, Korotkov E Int J Mol Sci. 2023; 24(16).

PMID: 37628742 PMC: 10454140. DOI: 10.3390/ijms241612561.


A mini-TGA protein modulates gene expression through heterogeneous association with transcription factors.

Tomaz S, Petek M, Lukan T, Pogacar K, Stare K, Teixeira Prates E Plant Physiol. 2022; 191(3):1934-1952.

PMID: 36517238 PMC: 10022624. DOI: 10.1093/plphys/kiac579.


Database of Potential Promoter Sequences in the Genome.

Rudenko V, Korotkov E Biology (Basel). 2022; 11(8).

PMID: 35892972 PMC: 9332048. DOI: 10.3390/biology11081117.


Sequence-based evaluation of promoter context for prediction of transcription start sites in Arabidopsis and rice.

Hiratsuka T, Makita Y, Y Yamamoto Y Sci Rep. 2022; 12(1):6976.

PMID: 35484393 PMC: 9050755. DOI: 10.1038/s41598-022-11169-w.


References
1.
Bernal A, Crammer K, Pereira F . Automated gene-model curation using global discriminative learning. Bioinformatics. 2012; 28(12):1571-8. DOI: 10.1093/bioinformatics/bts176. View

2.
Mejia-Guerra M, Li W, Galeano N, Vidal M, Gray J, Doseff A . Core Promoter Plasticity Between Maize Tissues and Genotypes Contrasts with Predominance of Sharp Transcription Initiation Sites. Plant Cell. 2015; 27(12):3309-20. PMC: 4707454. DOI: 10.1105/tpc.15.00630. View

3.
Li F, Chen J, Ge Z, Wen Y, Yue Y, Hayashida M . Computational prediction and interpretation of both general and specific types of promoters in Escherichia coli by exploiting a stacked ensemble-learning framework. Brief Bioinform. 2020; 22(2):2126-2140. PMC: 7986616. DOI: 10.1093/bib/bbaa049. View

4.
Morton T, Petricka J, Corcoran D, Li S, Winter C, Carda A . Paired-end analysis of transcription start sites in Arabidopsis reveals plant-specific promoter signatures. Plant Cell. 2014; 26(7):2746-60. PMC: 4145111. DOI: 10.1105/tpc.114.125617. View

5.
Lenhard B, Sandelin A, Carninci P . Metazoan promoters: emerging characteristics and insights into transcriptional regulation. Nat Rev Genet. 2012; 13(4):233-45. DOI: 10.1038/nrg3163. View