» Articles » PMID: 16108712

A Class of Edit Kernels for SVMs to Predict Translation Initiation Sites in Eukaryotic MRNAs

Overview
Journal J Comput Biol
Date 2005 Aug 20
PMID 16108712
Citations 13
Authors
Affiliations
Soon will be listed here.
Abstract

The prediction of translation initiation sites (TISs) in eukaryotic mRNAs has been a challenging problem in computational molecular biology. In this paper, we present a new algorithm to recognize TISs with a very high accuracy. Our algorithm includes two novel ideas. First, we introduce a class of new sequence-similarity kernels based on string editing, called edit kernels, for use with support vector machines (SVMs) in a discriminative approach to predict TISs. The edit kernels are simple and have significant biological and probabilistic interpretations. Although the edit kernels are not positive definite, it is easy to make the kernel matrix positive definite by adjusting the parameters. Second, we convert the region of an input mRNA sequence downstream to a putative TIS into an amino acid sequence before applying SVMs to avoid the high redundancy in the genetic code. The algorithm has been implemented and tested on previously published data. Our experimental results on real mRNA data show that both ideas improve the prediction accuracy greatly and that our method performs significantly better than those based on neural networks and SVMs with polynomial kernels or Salzberg kernels.

Citing Articles

A novel kernel based approach to arbitrary length symbolic data with application to type 2 diabetes risk.

Nwegbu N, Tirunagari S, Windridge D Sci Rep. 2022; 12(1):4985.

PMID: 35322076 PMC: 8943170. DOI: 10.1038/s41598-022-08757-1.


Predicting mean ribosome load for 5'UTR of any length using deep learning.

Karollus A, Avsec Z, Gagneur J PLoS Comput Biol. 2021; 17(5):e1008982.

PMID: 33970899 PMC: 8136849. DOI: 10.1371/journal.pcbi.1008982.


Global sequence features based translation initiation site prediction in human genomic sequences.

Goel N, Singh S, Aseri T Heliyon. 2020; 6(9):e04825.

PMID: 32964155 PMC: 7490824. DOI: 10.1016/j.heliyon.2020.e04825.


Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting.

McDermott J, Cort J, Nakayasu E, Pruneda J, Overall C, Adkins J PeerJ. 2019; 7:e7055.

PMID: 31211016 PMC: 6557245. DOI: 10.7717/peerj.7055.


TITER: predicting translation initiation sites by deep learning.

Zhang S, Hu H, Jiang T, Zhang L, Zeng J Bioinformatics. 2017; 33(14):i234-i242.

PMID: 28881981 PMC: 5870772. DOI: 10.1093/bioinformatics/btx247.