» Articles » PMID: 9283751

A Method for Identifying Splice Sites and Translational Start Sites in Eukaryotic MRNA

Overview
Date 1997 Aug 1
PMID 9283751
Citations 25
Authors
Affiliations
Soon will be listed here.
Abstract

This paper describes a new method for determining the consensus sequences that signal the start of translation and the boundaries between exons and introns (donor and acceptor sites) in eukaryotic mRNA. The method takes into account the dependencies between adjacent bases, in contrast to the usual technique of considering each position independently. When coupled with a dynamic program to compute the most likely sequence, new consensus sequences emerge. The consensus sequence information is summarized in conditional probability matrices which, when used to locate signals in uncharacterized genomic DNA, have greater sensitivity and specificity than conventional matrices. Species-specific versions of these matrices are especially effective at distinguishing true and false sites.

Citing Articles

From computational models of the splicing code to regulatory mechanisms and therapeutic implications.

Capitanchik C, Wilkins O, Wagner N, Gagneur J, Ule J Nat Rev Genet. 2024; 26(3):171-190.

PMID: 39358547 DOI: 10.1038/s41576-024-00774-2.


From shallow to deep: some lessons learned from application of machine learning for recognition of functional genomic elements in human genome.

Jankovic B, Gojobori T Hum Genomics. 2022; 16(1):7.

PMID: 35180894 PMC: 8855580. DOI: 10.1186/s40246-022-00376-1.


Global sequence features based translation initiation site prediction in human genomic sequences.

Goel N, Singh S, Aseri T Heliyon. 2020; 6(9):e04825.

PMID: 32964155 PMC: 7490824. DOI: 10.1016/j.heliyon.2020.e04825.


Discovery of human sORF-encoded polypeptides (SEPs) in cell lines and tissue.

Ma J, Ward C, Jungreis I, Slavoff S, Schwaid A, Neveu J J Proteome Res. 2014; 13(3):1757-65.

PMID: 24490786 PMC: 3993966. DOI: 10.1021/pr401280w.


A general approach for discriminative de novo motif discovery from high-throughput data.

Grau J, Posch S, Grosse I, Keilwagen J Nucleic Acids Res. 2013; 41(21):e197.

PMID: 24057214 PMC: 3834837. DOI: 10.1093/nar/gkt831.