» Articles » PMID: 30753697

DeepRibo: a Neural Network for Precise Gene Annotation of Prokaryotes by Combining Ribosome Profiling Signal and Binding Site Patterns

Overview
Specialty Biochemistry
Date 2019 Feb 13
PMID 30753697
Citations 28
Authors
Affiliations
Soon will be listed here.
Abstract

Annotation of gene expression in prokaryotes often finds itself corrected due to small variations of the annotated gene regions observed between different (sub)-species. It has become apparent that traditional sequence alignment algorithms, used for the curation of genomes, are not able to map the full complexity of the genomic landscape. We present DeepRibo, a novel neural network utilizing features extracted from ribosome profiling information and binding site sequence patterns that shows to be a precise tool for the delineation and annotation of expressed genes in prokaryotes. The neural network combines recurrent memory cells and convolutional layers, adapting the information gained from both the high-throughput ribosome profiling data and ribosome binding translation initiation sequence region into one model. DeepRibo is designed as a single model trained on a variety of ribosome profiling experiments, used for the identification of open reading frames in prokaryotes without a priori knowledge of the translational landscape. Through extensive validation of the model trained on various sets of data, multiple species sequence similarity, mass spectrometry and Edman degradation verified proteins, the effectiveness of DeepRibo is highlighted.

Citing Articles

The hidden bacterial microproteome.

Fesenko I, Sahakyan H, Dhyani R, Shabalina S, Storz G, Koonin E Mol Cell. 2025; 85(5):1024-1041.e6.

PMID: 39978337 PMC: 11890958. DOI: 10.1016/j.molcel.2025.01.025.


Principles, challenges, and advances in ribosome profiling: from bulk to low-input and single-cell analysis.

Wang Q, Mao Y Adv Biotechnol (Singap). 2025; 1(4):6.

PMID: 39883220 PMC: 11727582. DOI: 10.1007/s44307-023-00006-4.


Uncovering the small proteome of Methanosarcina mazei using Ribo-seq and peptidomics under different nitrogen conditions.

Tufail M, Jordan B, Hadjeras L, Gelhausen R, Cassidy L, Habenicht T Nat Commun. 2024; 15(1):8659.

PMID: 39370430 PMC: 11456600. DOI: 10.1038/s41467-024-53008-8.


A Comprehensive Review of Bioinformatics Tools for Genomic Biomarker Discovery Driving Precision Oncology.

Clark A, Lillard Jr J Genes (Basel). 2024; 15(8).

PMID: 39202397 PMC: 11353282. DOI: 10.3390/genes15081036.


The Cryptic Bacterial Microproteome.

Fesenko I, Sahakyan H, Shabalina S, Koonin E bioRxiv. 2024; .

PMID: 38903115 PMC: 11188072. DOI: 10.1101/2024.02.17.580829.


References
1.
Staes A, Impens F, Van Damme P, Ruttens B, Goethals M, Demol H . Selecting protein N-terminal peptides by combined fractional diagonal chromatography. Nat Protoc. 2011; 6(8):1130-41. DOI: 10.1038/nprot.2011.355. View

2.
Bazzini A, Johnstone T, Christiano R, Mackowiak S, Obermayer B, Fleming E . Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation. EMBO J. 2014; 33(9):981-93. PMC: 4193932. DOI: 10.1002/embj.201488411. View

3.
Lutz R, Stahel W, Lutz W . Statistical procedures to test for linearity and estimate threshold doses for tumor induction with nonlinear dose-response relationships in bioassays for carcinogenicity. Regul Toxicol Pharmacol. 2002; 36(3):331-7. DOI: 10.1006/rtph.2002.1583. View

4.
Davis A, Gohara D, Yap M . Sequence selectivity of macrolide-induced translational attenuation. Proc Natl Acad Sci U S A. 2014; 111(43):15379-84. PMC: 4217412. DOI: 10.1073/pnas.1410356111. View

5.
Alipanahi B, Delong A, Weirauch M, Frey B . Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat Biotechnol. 2015; 33(8):831-8. DOI: 10.1038/nbt.3300. View