» Articles » PMID: 39226898

SpliceVarDB: A Comprehensive Database of Experimentally Validated Human Splicing Variants

Overview
Journal Am J Hum Genet
Publisher Cell Press
Specialty Genetics
Date 2024 Sep 3
PMID 39226898
Authors
Affiliations
Soon will be listed here.
Abstract

Variants that alter gene splicing are estimated to comprise up to a third of all disease-causing variants, yet they are hard to predict from DNA sequencing data alone. To overcome this, many groups are incorporating RNA-based analyses, which are resource intensive, particularly for diagnostic laboratories. There are thousands of functionally validated variants that induce mis-splicing; however, this information is not consolidated, and they are under-represented in ClinVar, which presents a barrier to variant interpretation and can result in duplication of validation efforts. To address this issue, we developed SpliceVarDB, an online database consolidating over 50,000 variants assayed for their effects on splicing in over 8,000 human genes. We evaluated over 500 published data sources and established a spliceogenicity scale to standardize, harmonize, and consolidate variant validation data generated by a range of experimental protocols. According to the strength of their supporting evidence, variants were classified as "splice-altering" (∼25%), "not splice-altering" (∼25%), and "low-frequency splice-altering" (∼50%), which correspond to weak or indeterminate evidence of spliceogenicity. Importantly, 55% of the splice-altering variants in SpliceVarDB are outside the canonical splice sites (5.6% are deep intronic). These variants can support the variant curation diagnostic pathway and can be used to provide the high-quality data necessary to develop more accurate in silico splicing predictors. The variants are accessible through an online platform, SpliceVarDB, with additional features for visualization, variant information, in silico predictions, and validation metrics. SpliceVarDB is a very large collection of splice-altering variants and is available at https://splicevardb.org.

Citing Articles

Comprehensive Mapping of Human dsRNAome Reveals Conservation, Neuronal Enrichment, and Intermolecular Interactions.

Andrews R, Bass B bioRxiv. 2025; .

PMID: 39975386 PMC: 11838218. DOI: 10.1101/2025.01.24.634786.


Exploring the role of splicing in TP53 variant pathogenicity through predictions and minigene assays.

Fortuno C, Llinares-Burguet I, Canson D, de la Hoya M, Bueno-Martinez E, Sanoguera-Miralles L Hum Genomics. 2025; 19(1):2.

PMID: 39780207 PMC: 11715486. DOI: 10.1186/s40246-024-00714-5.


Best practices for germline variant and DNA methylation analysis of second- and third-generation sequencing data.

Bonfiglio F, Legati A, Lasorsa V, Palombo F, De Riso G, Isidori F Hum Genomics. 2024; 18(1):120.

PMID: 39501379 PMC: 11536923. DOI: 10.1186/s40246-024-00684-8.

References
1.
Sondka Z, Bamford S, Cole C, Ward S, Dunham I, Forbes S . The COSMIC Cancer Gene Census: describing genetic dysfunction across all human cancers. Nat Rev Cancer. 2018; 18(11):696-705. PMC: 6450507. DOI: 10.1038/s41568-018-0060-1. View

2.
Cooper T . Use of minigene systems to dissect alternative splicing elements. Methods. 2005; 37(4):331-40. DOI: 10.1016/j.ymeth.2005.07.015. View

3.
Thormann A, Halachev M, McLaren W, Moore D, Svinti V, Campbell A . Flexible and scalable diagnostic filtering of genomic variants using G2P with Ensembl VEP. Nat Commun. 2019; 10(1):2373. PMC: 6542828. DOI: 10.1038/s41467-019-10016-3. View

4.
Soemedi R, Cygan K, Rhine C, Wang J, Bulacan C, Yang J . Pathogenic variants that alter protein code often disrupt splicing. Nat Genet. 2017; 49(6):848-855. PMC: 6679692. DOI: 10.1038/ng.3837. View

5.
Danecek P, Bonfield J, Liddle J, Marshall J, Ohan V, Pollard M . Twelve years of SAMtools and BCFtools. Gigascience. 2021; 10(2). PMC: 7931819. DOI: 10.1093/gigascience/giab008. View