» Articles » PMID: 33979731

High Precision in MicroRNA Prediction: A Novel Genome-wide Approach with Convolutional Deep Residual Networks

Overview
Journal Comput Biol Med
Publisher Elsevier
Date 2021 May 12
PMID 33979731
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

MicroRNAs (miRNAs) are small non-coding RNAs that have a key role in the regulation of gene expression. The importance of miRNAs is widely acknowledged by the community nowadays and computational methods are needed for the precise prediction of novel candidates to miRNA. This task can be done by searching homologous with sequence alignment tools, but results are restricted to sequences that are very similar to the known miRNA precursors (pre-miRNAs). Besides, a very important property of pre-miRNAs, their secondary structure, is not taken into account by these methods. To fill this gap, many machine learning approaches were proposed in the last years. However, the methods are generally tested in very controlled conditions. If these methods were used under real conditions, the false positives increase and the precisions fall quite below those published. This work provides a novel approach for dealing with the computational prediction of pre-miRNAs: a convolutional deep residual neural network (mirDNN). This model was tested with several genomes of animals and plants, the full-genomes, achieving a precision up to 5 times larger than other approaches at the same recall rates. Furthermore, a novel validation methodology was used to ensure that the performance reported in this study can be effectively achieved when using mirDNN in novel species. To provide fast an easy access to mirDNN, a web demo is available at http://sinc.unl.edu.ar/web-demo/mirdnn/. The demo can process FASTA files with multiple sequences to calculate the prediction scores and generates the nucleotide importance plots. FULL SOURCE CODE: http://sourceforge.net/projects/sourcesinc/files/mirdnn and https://github.com/cyones/mirDNN. CONTACT: gstegmayer@sinc.unl.edu.ar.

Citing Articles

pmiRScan: a LightGBM based method for prediction of animal pre-miRNAs.

Venkatesan A, Basak J, Bahadur R Funct Integr Genomics. 2025; 25(1):9.

PMID: 39786653 DOI: 10.1007/s10142-025-01527-y.


Description Generation Using Variational Auto-Encoders for Precursor microRNA.

Petkovic M, Menkovski V Entropy (Basel). 2024; 26(11).

PMID: 39593866 PMC: 11592592. DOI: 10.3390/e26110921.


ACP-GBDT: An improved anticancer peptide identification method with gradient boosting decision tree.

Li Y, Ma D, Chen D, Chen Y Front Genet. 2023; 14:1165765.

PMID: 37065496 PMC: 10090421. DOI: 10.3389/fgene.2023.1165765.


Benchmarking machine learning robustness in Covid-19 genome sequence classification.

Ali S, Sahoo B, Zelikovsky A, Chen P, Patterson M Sci Rep. 2023; 13(1):4154.

PMID: 36914815 PMC: 10010240. DOI: 10.1038/s41598-023-31368-3.


A comparison of contributions of individual muscle and combination muscles to interaction force prediction using KPCA-DRSN model.

Lu W, Gao L, Cao H, Li Z, Wang D Front Bioeng Biotechnol. 2022; 10:970859.

PMID: 36159693 PMC: 9491850. DOI: 10.3389/fbioe.2022.970859.