» Articles » PMID: 30590704

BeRBP: Binding Estimation for Human RNA-binding Proteins

Overview
Specialty Biochemistry
Date 2018 Dec 28
PMID 30590704
Citations 23
Authors
Affiliations
Soon will be listed here.
Abstract

Identifying binding targets of RNA-binding proteins (RBPs) can greatly facilitate our understanding of their functional mechanisms. Most computational methods employ machine learning to train classifiers on either RBP-specific targets or pooled RBP-RNA interactions. The former strategy is more powerful, but it only applies to a few RBPs with a large number of known targets; conversely, the latter strategy sacrifices prediction accuracy for a wider application, since specific interaction features are inevitably obscured through pooling heterogeneous datasets. Here, we present beRBP, a dual approach to predict human RBP-RNA interaction given PWM of a RBP and one RNA sequence. Based on Random Forests, beRBP not only builds a specific model for each RBP with a decent number of known targets, but also develops a general model for RBPs with limited or null known targets. The specific and general models both compared well with existing methods on three benchmark datasets. Notably, the general model achieved a better performance than existing methods on most novel RBPs. Overall, as a composite solution overarching the RBP-specific and RBP-General strategies, beRBP is a promising tool for human RBP binding estimation with good prediction accuracy and a broad application scope.

Citing Articles

RBPsuite 2.0: an updated RNA-protein binding site prediction suite with high coverage on species and proteins based on deep learning.

Pan X, Fang Y, Liu X, Guo X, Shen H BMC Biol. 2025; 23(1):74.

PMID: 40069726 PMC: 11899677. DOI: 10.1186/s12915-025-02182-2.


DeepMiRBP: a hybrid model for predicting microRNA-protein interactions based on transfer learning and cosine similarity.

Azizian S, Cui J BMC Bioinformatics. 2024; 25(1):381.

PMID: 39695955 PMC: 11656930. DOI: 10.1186/s12859-024-05985-2.


RNA-protein interaction prediction without high-throughput data: An overview and benchmark of tools.

Krautwurst S, Lamkiewicz K Comput Struct Biotechnol J. 2024; 23:4036-4046.

PMID: 39610906 PMC: 11603007. DOI: 10.1016/j.csbj.2024.11.015.


C2CDB: an advanced platform integrating comprehensive information and analysis tools of cancer-related circRNAs.

Zuo Y, Liu W, Jin Y, Pan Y, Fan T, Fu X Bioinform Adv. 2024; 4(1):vbae112.

PMID: 39246384 PMC: 11379471. DOI: 10.1093/bioadv/vbae112.


EVPsort: An Atlas of Small ncRNA Profiling and Sorting in Extracellular Vesicles and Particles.

Chen H, Wang J, Coffey R, Patton J, Weaver A, Shyr Y J Mol Biol. 2024; 436(17):168571.

PMID: 38604528 PMC: 11574917. DOI: 10.1016/j.jmb.2024.168571.


References
1.
Strazar M, Zitnik M, Zupan B, Ule J, Curk T . Orthogonal matrix factorization enables integrative analysis of multiple RNA binding proteins. Bioinformatics. 2016; 32(10):1527-35. PMC: 4894278. DOI: 10.1093/bioinformatics/btw003. View

2.
Morgulis A, Coulouris G, Raytselis Y, Madden T, Agarwala R, Schaffer A . Database indexing for production MegaBLAST searches. Bioinformatics. 2008; 24(16):1757-64. PMC: 2696921. DOI: 10.1093/bioinformatics/btn322. View

3.
Pancaldi V, Bahler J . In silico characterization and prediction of global protein-mRNA interactions in yeast. Nucleic Acids Res. 2011; 39(14):5826-36. PMC: 3152324. DOI: 10.1093/nar/gkr160. View

4.
Van Nostrand E, Pratt G, Shishkin A, Gelboin-Burkhart C, Fang M, Sundararaman B . Robust transcriptome-wide discovery of RNA-binding protein binding sites with enhanced CLIP (eCLIP). Nat Methods. 2016; 13(6):508-14. PMC: 4887338. DOI: 10.1038/nmeth.3810. View

5.
Dassi E, Re A, Leo S, Tebaldi T, Pasini L, Peroni D . AURA 2: Empowering discovery of post-transcriptional networks. Translation (Austin). 2016; 2(1):e27738. PMC: 4705823. DOI: 10.4161/trla.27738. View