» Articles » PMID: 37080758

Using Traditional Machine Learning and Deep Learning Methods for On- and Off-target Prediction in CRISPR/Cas9: a Review

Overview
Journal Brief Bioinform
Specialty Biology
Date 2023 Apr 20
PMID 37080758
Authors
Affiliations
Soon will be listed here.
Abstract

CRISPR/Cas9 (Clustered Regularly Interspaced Short Palindromic Repeats and CRISPR-associated protein 9) is a popular and effective two-component technology used for targeted genetic manipulation. It is currently the most versatile and accurate method of gene and genome editing, which benefits from a large variety of practical applications. For example, in biomedicine, it has been used in research related to cancer, virus infections, pathogen detection, and genetic diseases. Current CRISPR/Cas9 research is based on data-driven models for on- and off-target prediction as a cleavage may occur at non-target sequence locations. Nowadays, conventional machine learning and deep learning methods are applied on a regular basis to accurately predict on-target knockout efficacy and off-target profile of given single-guide RNAs (sgRNAs). In this paper, we present an overview and a comparative analysis of traditional machine learning and deep learning models used in CRISPR/Cas9. We highlight the key research challenges and directions associated with target activity prediction. We discuss recent advances in the sgRNA-DNA sequence encoding used in state-of-the-art on- and off-target prediction models. Furthermore, we present the most popular deep learning neural network architectures used in CRISPR/Cas9 prediction models. Finally, we summarize the existing challenges and discuss possible future investigations in the field of on- and off-target prediction. Our paper provides valuable support for academic and industrial researchers interested in the application of machine learning methods in the field of CRISPR/Cas9 genome editing.

Citing Articles

Gene therapy for genetic diseases: challenges and future directions.

Qie B, Tuo J, Chen F, Ding H, Lyu L MedComm (2020). 2025; 6(2):e70091.

PMID: 39949979 PMC: 11822459. DOI: 10.1002/mco2.70091.


Transitioning from wet lab to artificial intelligence: a systematic review of AI predictors in CRISPR.

Abbasi A, Asim M, Dengel A J Transl Med. 2025; 23(1):153.

PMID: 39905452 PMC: 11796103. DOI: 10.1186/s12967-024-06013-w.


Predicting CRISPR-Cas9 off-target effects in human primary cells using bidirectional LSTM with BERT embedding.

Sari O, Liu Z, Pan Y, Shao X Bioinform Adv. 2025; 5(1):vbae184.

PMID: 39758829 PMC: 11696696. DOI: 10.1093/bioadv/vbae184.


DeepMEns: an ensemble model for predicting sgRNA on-target activity based on multiple features.

Ding S, Zheng J, Jia C Brief Funct Genomics. 2024; 24.

PMID: 39528429 PMC: 11735754. DOI: 10.1093/bfgp/elae043.


Balanced Training Sets Improve Deep Learning-Based Prediction of CRISPR sgRNA Activity.

Trivedi V, Mohseni A, Lonardi S, Wheeldon I ACS Synth Biol. 2024; 13(11):3774-3781.

PMID: 39495623 PMC: 11574921. DOI: 10.1021/acssynbio.4c00542.


References
1.
Liu Q, He D, Xie L . Prediction of off-target specificity and cell-specific fitness of CRISPR-Cas System using attention boosted deep learning and network-based gene feature. PLoS Comput Biol. 2019; 15(10):e1007480. PMC: 6837542. DOI: 10.1371/journal.pcbi.1007480. View

2.
Mali P, Aach J, Stranges P, Esvelt K, Moosburner M, Kosuri S . CAS9 transcriptional activators for target specificity screening and paired nickases for cooperative genome engineering. Nat Biotechnol. 2013; 31(9):833-8. PMC: 3818127. DOI: 10.1038/nbt.2675. View

3.
Heigwer F, Kerr G, Boutros M . E-CRISP: fast CRISPR target site identification. Nat Methods. 2014; 11(2):122-3. DOI: 10.1038/nmeth.2812. View

4.
Raitskin O, Patron N . Multi-gene engineering in plants with RNA-guided Cas9 nuclease. Curr Opin Biotechnol. 2015; 37:69-75. DOI: 10.1016/j.copbio.2015.11.008. View

5.
Stortz F, Minary P . crisprSQL: a novel database platform for CRISPR/Cas off-target cleavage assays. Nucleic Acids Res. 2020; 49(D1):D855-D861. PMC: 7778913. DOI: 10.1093/nar/gkaa885. View