» Articles » PMID: 31969902

A Novel Hybrid CNN-SVR for CRISPR/Cas9 Guide RNA Activity Prediction

Overview
Journal Front Genet
Date 2020 Jan 24
PMID 31969902
Citations 14
Authors
Affiliations
Soon will be listed here.
Abstract

Accurate prediction of guide RNA (gRNA) on-target efficacy is critical for effective application of CRISPR/Cas9 system. Although some machine learning-based and convolutional neural network (CNN)-based methods have been proposed, prediction accuracy remains to be improved. Here, firstly we improved architectures of current CNNs for predicting gRNA on-target efficacy. Secondly, we proposed a novel hybrid system which combines our improved CNN with support vector regression (SVR). This CNN-SVR system is composed of two major components: a merged CNN as the front-end for extracting gRNA feature and an SVR as the back-end for regression and predicting gRNA cleavage efficiency. We demonstrate that CNN-SVR can effectively exploit features interactions from feed-forward directions to learn deeper features of gRNAs and their corresponding epigenetic features. Experiments on commonly used datasets show that our CNN-SVR system outperforms available state-of-the-art methods in terms of prediction accuracy, generalization, and robustness. Source codes are available at https://github.com/Peppags/CNN-SVR.

Citing Articles

Transitioning from wet lab to artificial intelligence: a systematic review of AI predictors in CRISPR.

Abbasi A, Asim M, Dengel A J Transl Med. 2025; 23(1):153.

PMID: 39905452 PMC: 11796103. DOI: 10.1186/s12967-024-06013-w.


DeepMEns: an ensemble model for predicting sgRNA on-target activity based on multiple features.

Ding S, Zheng J, Jia C Brief Funct Genomics. 2024; 24.

PMID: 39528429 PMC: 11735754. DOI: 10.1093/bfgp/elae043.


The Evolution of Nucleic Acid-Based Diagnosis Methods from the (pre-)CRISPR to CRISPR era and the Associated Machine/Deep Learning Approaches in Relevant RNA Design.

Chakraborty S, Ray Dutta J, Ganesan R, Minary P Methods Mol Biol. 2024; 2847:241-300.

PMID: 39312149 DOI: 10.1007/978-1-0716-4079-1_17.


Codon usage and expression-based features significantly improve prediction of CRISPR efficiency.

Bergman S, Tuller T NPJ Syst Biol Appl. 2024; 10(1):100.

PMID: 39227603 PMC: 11372048. DOI: 10.1038/s41540-024-00431-8.


Strong association between genomic 3D structure and CRISPR cleavage efficiency.

Bergman S, Tuller T PLoS Comput Biol. 2024; 20(6):e1012214.

PMID: 38848440 PMC: 11189236. DOI: 10.1371/journal.pcbi.1012214.


References
1.
Shou J, Li J, Liu Y, Wu Q . Precise and Predictable CRISPR Chromosomal Rearrangements Reveal Principles of Cas9-Mediated Nucleotide Insertion. Mol Cell. 2018; 71(4):498-509.e4. DOI: 10.1016/j.molcel.2018.06.021. View

2.
Xu H, Xiao T, Chen C, Li W, Meyer C, Wu Q . Sequence determinants of improved CRISPR sgRNA design. Genome Res. 2015; 25(8):1147-57. PMC: 4509999. DOI: 10.1101/gr.191452.115. View

3.
Wong N, Liu W, Wang X . WU-CRISPR: characteristics of functional guide RNAs for the CRISPR/Cas9 system. Genome Biol. 2015; 16:218. PMC: 4629399. DOI: 10.1186/s13059-015-0784-0. View

4.
Listgarten J, Weinstein M, Kleinstiver B, Sousa A, Joung J, Crawford J . Prediction of off-target activities for the end-to-end design of CRISPR guide RNAs. Nat Biomed Eng. 2018; 2(1):38-47. PMC: 6037314. DOI: 10.1038/s41551-017-0178-6. View

5.
Mukaka M . Statistics corner: A guide to appropriate use of correlation coefficient in medical research. Malawi Med J. 2013; 24(3):69-71. PMC: 3576830. View