» Articles » PMID: 39896553

Generative Modeling for RNA Splicing Predictions and Design

Overview
Journal bioRxiv
Date 2025 Feb 3
PMID 39896553
Authors
Affiliations
Soon will be listed here.
Abstract

Alternative splicing (AS) of pre-mRNA plays a crucial role in tissue-specific gene regulation, with disease implications due to splicing defects. Predicting and manipulating AS can therefore uncover new regulatory mechanisms and aid in therapeutics design. We introduce TrASPr+BOS, a generative AI model with Bayesian Optimization for predicting and designing RNA for tissue-specific splicing outcomes. TrASPr is a multi-transformer model that can handle different types of AS events and generalize to unseen cellular conditions. It then serves as an oracle, generating labeled data to train a Bayesian Optimization for Splicing (BOS) algorithm to design RNA for condition-specific splicing outcomes. We show TrASPr+BOS outperforms existing methods, enhancing tissue-specific AUPRC by up to 2.4 fold and capturing tissue-specific regulatory elements. We validate hundreds of predicted novel tissue-specific splicing variations and confirm new regulatory elements using dCas13. We envision TrASPr+BOS as a light yet accurate method researchers can probe or adopt for specific tasks.

References
1.
Winter R, Montanari F, Steffen A, Briem H, Noe F, Clevert D . Efficient multi-objective molecular optimization in a continuous latent space. Chem Sci. 2019; 10(34):8016-8024. PMC: 6836962. DOI: 10.1039/c9sc01928f. View

2.
Husedzinovic A, Neumann B, Reymann J, Draeger-Meurer S, Chari A, Erfle H . The catalytically inactive tyrosine phosphatase HD-PTP/PTPN23 is a novel regulator of SMN complex localization. Mol Biol Cell. 2014; 26(2):161-71. PMC: 4294665. DOI: 10.1091/mbc.E14-06-1151. View

3.
Bend R, Cohen L, Carter M, Lyons M, Niyazov D, Mikati M . Phenotype and mutation expansion of the PTPN23 associated disorder characterized by neurodevelopmental delay and structural brain abnormalities. Eur J Hum Genet. 2019; 28(1):76-87. PMC: 6906308. DOI: 10.1038/s41431-019-0487-1. View

4.
Ugolino J, Fang S, Kubisch C, Monteiro M . Mutant Atp13a2 proteins involved in parkinsonism are degraded by ER-associated degradation and sensitize cells to ER-stress induced cell death. Hum Mol Genet. 2011; 20(18):3565-77. PMC: 3159557. DOI: 10.1093/hmg/ddr274. View

5.
Ji Y, Zhou Z, Liu H, Davuluri R . DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome. Bioinformatics. 2021; 37(15):2112-2120. PMC: 11025658. DOI: 10.1093/bioinformatics/btab083. View