» Articles » PMID: 39407093

Be-dataHIVE: a Base Editing Database

Overview
Publisher Biomed Central
Specialty Biology
Date 2024 Oct 15
PMID 39407093
Authors
Affiliations
Soon will be listed here.
Abstract

Base editing is an enhanced gene editing approach that enables the precise transformation of single nucleotides and has the potential to cure rare diseases. The design process of base editors is labour-intensive and outcomes are not easily predictable. For any clinical use, base editing has to be accurate and efficient. Thus, any bystander mutations have to be minimized. In recent years, computational models to predict base editing outcomes have been developed. However, the overall robustness and performance of those models is limited. One way to improve the performance is to train models on a diverse, feature-rich, and large dataset, which does not exist for the base editing field. Hence, we develop BE-dataHIVE, a mySQL database that covers over 460,000 gRNA target combinations. The current version of BE-dataHIVE consists of data from five studies and is enriched with melting temperatures and energy terms. Furthermore, multiple different data structures for machine learning were computed and are directly available. The database can be accessed via our website https://be-datahive.com/ or API and is therefore suitable for practitioners and machine learning researchers.

References
1.
Stortz F, Minary P . crisprSQL: a novel database platform for CRISPR/Cas off-target cleavage assays. Nucleic Acids Res. 2020; 49(D1):D855-D861. PMC: 7778913. DOI: 10.1093/nar/gkaa885. View

2.
Koblan L, Arbab M, Shen M, Hussmann J, Anzalone A, Doman J . Efficient C•G-to-G•C base editors developed using CRISPRi screens, target-library analysis, and machine learning. Nat Biotechnol. 2021; 39(11):1414-1425. PMC: 8985520. DOI: 10.1038/s41587-021-00938-z. View

3.
Gruber A, Lorenz R, Bernhart S, Neubock R, Hofacker I . The Vienna RNA websuite. Nucleic Acids Res. 2008; 36(Web Server issue):W70-4. PMC: 2447809. DOI: 10.1093/nar/gkn188. View

4.
Stortz F, Mak J, Minary P . piCRISPR: Physically informed deep learning models for CRISPR/Cas9 off-target cleavage prediction. Artif Intell Life Sci. 2023; 3:None. PMC: 10316064. DOI: 10.1016/j.ailsci.2023.100075. View

5.
Dandage R, Despres P, Yachie N, Landry C . : A Computational Workflow for Designing Libraries of Guide RNAs for CRISPR-Mediated Base Editing. Genetics. 2019; 212(2):377-385. PMC: 6553823. DOI: 10.1534/genetics.119.302089. View