» Articles » PMID: 21172046

MiRenSVM: Towards Better Prediction of MicroRNA Precursors Using an Ensemble SVM Classifier with Multi-loop Features

Overview
Publisher Biomed Central
Specialty Biology
Date 2010 Dec 22
PMID 21172046
Citations 36
Authors
Affiliations
Soon will be listed here.
Abstract

Background: MicroRNAs (simply miRNAs) are derived from larger hairpin RNA precursors and play essential regular roles in both animals and plants. A number of computational methods for miRNA genes finding have been proposed in the past decade, yet the problem is far from being tackled, especially when considering the imbalance issue of known miRNAs and unidentified miRNAs, and the pre-miRNAs with multi-loops or higher minimum free energy (MFE). This paper presents a new computational approach, miRenSVM, for finding miRNA genes. Aiming at better prediction performance, an ensemble support vector machine (SVM) classifier is established to deal with the imbalance issue, and multi-loop features are included for identifying those pre-miRNAs with multi-loops.

Results: We collected a representative dataset, which contains 697 real miRNA precursors identified by experimental procedure and other computational methods, and 5428 pseudo ones from several datasets. Experiments showed that our miRenSVM achieved a 96.5% specificity and a 93.05% sensitivity on the dataset. Compared with the state-of-the-art approaches, miRenSVM obtained better prediction results. We also applied our method to predict 14 Homo sapiens pre-miRNAs and 13 Anopheles gambiae pre-miRNAs that first appeared in miRBase13.0, MiRenSVM got a 100% prediction rate. Furthermore, performance evaluation was conducted over 27 additional species in miRBase13.0, and 92.84% (4863/5238) animal pre-miRNAs were correctly identified by miRenSVM.

Conclusion: MiRenSVM is an ensemble support vector machine (SVM) classification system for better detecting miRNA genes, especially those with multi-loop secondary structure.

Citing Articles

Description Generation Using Variational Auto-Encoders for Precursor microRNA.

Petkovic M, Menkovski V Entropy (Basel). 2024; 26(11).

PMID: 39593866 PMC: 11592592. DOI: 10.3390/e26110921.


Inferring microRNA regulation: A proteome perspective.

Ofer D, Linial M Front Mol Biosci. 2022; 9:916639.

PMID: 36158574 PMC: 9493312. DOI: 10.3389/fmolb.2022.916639.


Hybrid Deep Neural Network for Handling Data Imbalance in Precursor MicroRNA.

R E, Jain D, Kotecha K, Pandya S, Reddy S, E R Front Public Health. 2022; 9:821410.

PMID: 35004605 PMC: 8733243. DOI: 10.3389/fpubh.2021.821410.


A systematic review of the application of machine learning in the detection and classification of transposable elements.

Orozco-Arias S, Isaza G, Guyot R, Tabares-Soto R PeerJ. 2020; 7:e8311.

PMID: 31976169 PMC: 6967008. DOI: 10.7717/peerj.8311.


Development of species specific putative miRNA and its target prediction tool in wheat (Triticum aestivum L.).

Jaiswal S, Iquebal M, Arora V, Sheoran S, Sharma P, Angadi U Sci Rep. 2019; 9(1):3790.

PMID: 30846812 PMC: 6405928. DOI: 10.1038/s41598-019-40333-y.


References
1.
Lim L, Lau N, Weinstein E, Abdelhakim A, Yekta S, Rhoades M . The microRNAs of Caenorhabditis elegans. Genes Dev. 2003; 17(8):991-1008. PMC: 196042. DOI: 10.1101/gad.1074403. View

2.
Jiang P, Wu H, Wang W, Ma W, Sun X, Lu Z . MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features. Nucleic Acids Res. 2007; 35(Web Server issue):W339-44. PMC: 1933124. DOI: 10.1093/nar/gkm368. View

3.
Terai G, Komori T, Asai K, Kin T . miRRim: a novel system to find conserved miRNAs with high sensitivity and specificity. RNA. 2007; 13(12):2081-90. PMC: 2080609. DOI: 10.1261/rna.655107. View

4.
Grad Y, Aach J, Hayes G, Reinhart B, Church G, Ruvkun G . Computational and experimental identification of C. elegans microRNAs. Mol Cell. 2003; 11(5):1253-63. DOI: 10.1016/s1097-2765(03)00153-9. View

5.
Sheng Y, Engstrom P, Lenhard B . Mammalian microRNA prediction through a support vector machine model of sequence and structure. PLoS One. 2007; 2(9):e946. PMC: 1978525. DOI: 10.1371/journal.pone.0000946. View