» Articles » PMID: 31358877

Multi-view Co-training for MicroRNA Prediction

Overview
Journal Sci Rep
Specialty Science
Date 2019 Jul 31
PMID 31358877
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

MicroRNA (miRNA) are short, non-coding RNAs involved in cell regulation at post-transcriptional and translational levels. Numerous computational predictors of miRNA been developed that generally classify miRNA based on either sequence- or expression-based features. While these methods are highly effective, they require large labelled training data sets, which are often not available for many species. Simultaneously, emerging high-throughput wet-lab experimental procedures are producing large unlabelled data sets of genomic sequence and RNA expression profiles. Existing methods use supervised machine learning and are therefore unable to leverage these unlabelled data. In this paper, we design and develop a multi-view co-training approach for the classification of miRNA to maximize the utility of unlabelled training data by taking advantage of multiple views of the problem. Starting with only 10 labelled training data, co-training is shown to significantly (p < 0.01) increase classification accuracy of both sequence- and expression-based classifiers, without requiring any new labelled training data. After 11 iterations of co-training, the expression-based view of miRNA classification experiences an average increase in AUPRC of 15.81% over six species, compared to 11.90% for self-training and 4.84% for passive learning. Similar results are observed for sequence-based classifiers with increases of 46.47%, 39.53% and 29.43%, for co-training, self-training, and passive learning, respectively. The final co-trained sequence and expression-based classifiers are integrated into a final confidence-based classifier which shows improved performance compared to both the expression (1.5%, p = 0.021) and sequence (3.7%, p = 0.006) views. This study represents the first application of multi-view co-training to miRNA prediction and shows great promise, particularly for understudied species with few available training data.

Citing Articles

Enhancing severe hypoglycemia prediction in type 2 diabetes mellitus through multi-view co-training machine learning model for imbalanced dataset.

Agraz M, Deng Y, Karniadakis G, Mantzoros C Sci Rep. 2024; 14(1):22741.

PMID: 39349500 PMC: 11444036. DOI: 10.1038/s41598-024-69844-z.


Species-specific microRNA discovery and target prediction in the soybean cyst nematode.

Ajila V, Colley L, Ste-Croix D, Nissan N, Cober E, Mimee B Sci Rep. 2023; 13(1):17657.

PMID: 37848601 PMC: 10582106. DOI: 10.1038/s41598-023-44469-w.


Prostatic fluid exosome-mediated microRNA-155 promotes the pathogenesis of type IIIA chronic prostatitis.

Zhao B, Zheng J, Qiao Y, Wang Y, Luo Y, Zhang D Transl Androl Urol. 2021; 10(5):1976-1987.

PMID: 34159078 PMC: 8185664. DOI: 10.21037/tau-21-139.


CFSP: a collaborative frequent sequence pattern discovery algorithm for nucleic acid sequence classification.

Peng H PeerJ. 2020; 8:e8965.

PMID: 32341900 PMC: 7179567. DOI: 10.7717/peerj.8965.


A semi-supervised machine learning framework for microRNA classification.

Sheikh Hassani M, Green J Hum Genomics. 2019; 13(Suppl 1):43.

PMID: 31639051 PMC: 6805288. DOI: 10.1186/s40246-019-0221-7.

References
1.
Luo Q, Zhang Z, Dai Z, Basnet S, Li S, Xu B . Tumor-suppressive microRNA-195-5p regulates cell growth and inhibits cell cycle by targeting cyclin dependent kinase 8 in colon cancer. Am J Transl Res. 2016; 8(5):2088-96. PMC: 4891422. View

2.
Yones C, Stegmayer G, Milone D . Genome-wide pre-miRNA discovery from few labeled examples. Bioinformatics. 2017; 34(4):541-549. DOI: 10.1093/bioinformatics/btx612. View

3.
Hollins S, Cairns M . MicroRNA: Small RNA mediators of the brains genomic response to environmental stress. Prog Neurobiol. 2016; 143:61-81. DOI: 10.1016/j.pneurobio.2016.06.005. View

4.
Friedlander M, Mackowiak S, Li N, Chen W, Rajewsky N . miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades. Nucleic Acids Res. 2011; 40(1):37-52. PMC: 3245920. DOI: 10.1093/nar/gkr688. View

5.
Sugita S, Yoshino H, Yonemori M, Miyamoto K, Matsushita R, Sakaguchi T . Tumor‑suppressive microRNA‑223 targets WDR62 directly in bladder cancer. Int J Oncol. 2019; 54(6):2222-2236. DOI: 10.3892/ijo.2019.4762. View