» Articles » PMID: 37189388

Hybrid Multitask Learning Reveals Sequence Features Driving Specificity in the CRISPR/Cas9 System

Overview
Journal Biomolecules
Publisher MDPI
Date 2023 May 16
PMID 37189388
Authors
Affiliations
Soon will be listed here.
Abstract

CRISPR/Cas9 technology is capable of precisely editing genomes and is at the heart of various scientific and medical advances in recent times. The advances in biomedical research are hindered because of the inadvertent burden on the genome when genome editors are employed-the off-target effects. Although experimental screens to detect off-targets have allowed understanding the activity of Cas9, that knowledge remains incomplete as the rules do not extrapolate well to new target sequences. Off-target prediction tools developed recently have increasingly relied on machine learning and deep learning techniques to reliably understand the complete threat of likely off-targets because the rules that drive Cas9 activity are not fully understood. In this study, we present a count-based as well as deep-learning-based approach to derive sequence features that are important in deciding on Cas9 activity at a sequence. There are two major challenges in off-target determination-the identification of a likely site of Cas9 activity and the prediction of the extent of Cas9 activity at that site. The hybrid multitask CNN-biLSTM model developed, named CRISP-RCNN, simultaneously predicts off-targets and the extent of activity on off-targets. Employing methods of integrated gradients and weighting kernels for feature importance approximation, analysis of nucleotide and position preference, and mismatch tolerance have been performed.

Citing Articles

Transitioning from wet lab to artificial intelligence: a systematic review of AI predictors in CRISPR.

Abbasi A, Asim M, Dengel A J Transl Med. 2025; 23(1):153.

PMID: 39905452 PMC: 11796103. DOI: 10.1186/s12967-024-06013-w.


Crispr-SGRU: Prediction of CRISPR/Cas9 Off-Target Activities with Mismatches and Indels Using Stacked BiGRU.

Zhang G, Luo Y, Xie H, Dai Z Int J Mol Sci. 2024; 25(20).

PMID: 39456727 PMC: 11507390. DOI: 10.3390/ijms252010945.


Improved Machine Learning-Based Model for the Classification of Off-Targets in the CRISPR/Cpf1 System.

Kesarwani P, Vora D, Sundar D ACS Omega. 2023; 8(48):45578-45588.

PMID: 38075777 PMC: 10701861. DOI: 10.1021/acsomega.3c05691.


Deep learning in CRISPR-Cas systems: a review of recent studies.

Lee M Front Bioeng Biotechnol. 2023; 11:1226182.

PMID: 37469443 PMC: 10352112. DOI: 10.3389/fbioe.2023.1226182.

References
1.
Kim H, Kim Y, Lee S, Min S, Bae J, Choi J . SpCas9 activity prediction by DeepSpCas9, a deep learning-based model with high generalization performance. Sci Adv. 2019; 5(11):eaax9249. PMC: 6834390. DOI: 10.1126/sciadv.aax9249. View

2.
Zou J, Huss M, Abid A, Mohammadi P, Torkamani A, Telenti A . A primer on deep learning in genomics. Nat Genet. 2018; 51(1):12-18. PMC: 11180539. DOI: 10.1038/s41588-018-0295-5. View

3.
Mitchell B, Hsu R, Medrano M, Zewde N, Narkhede Y, Palermo G . Spontaneous Embedding of DNA Mismatches Within the RNA:DNA Hybrid of CRISPR-Cas9. Front Mol Biosci. 2020; 7:39. PMC: 7093078. DOI: 10.3389/fmolb.2020.00039. View

4.
Tang L, Yang F, He X, Xie H, Liu X, Fu J . Efficient cleavage resolves PAM preferences of CRISPR-Cas in human cells. Cell Regen. 2019; 8(2):44-50. PMC: 6833347. DOI: 10.1016/j.cr.2019.08.002. View

5.
Semenova E, Jore M, Datsenko K, Semenova A, Westra E, Wanner B . Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence. Proc Natl Acad Sci U S A. 2011; 108(25):10098-103. PMC: 3121866. DOI: 10.1073/pnas.1104144108. View