Single Base-pair Substitutions in Exon-intron Junctions of Human Genes: Nature, Distribution, and Consequences for MRNA Splicing
Authors
Affiliations
Although single base-pair substitutions in splice junctions constitute at least 10% of all mutations causing human inherited disease, the factors that determine their phenotypic consequences at the RNA level remain to be fully elucidated. Employing a neural network for splice-site recognition, we performed a meta-analysis of 478 disease-associated splicing mutations, in 38 different genes, for which detailed laboratory-based mRNA phenotype assessment had been performed. Inspection of the +/-50-bp DNA sequence context of the mutations revealed that exon skipping was the preferred phenotype when the immediate vicinity of the affected exon-intron junctions was devoid of alternative splice-sites. By contrast, in the presence of at least one such motif, cryptic splice-site utilization, became more prevalent. This association was, however, confined to donor splice-sites. Outside the obligate dinucleotide, the spatial distribution of pathological mutations was found to differ significantly from that of SNPs. Whereas disease-associated lesions clustered at positions -1 and +3 to +6 for donor sites and -3 for acceptor sites, SNPs were found to be almost evenly distributed over all sequence positions considered. When all putative missense mutations in the vicinity of splice-sites were extracted from the Human Gene Mutation Database for the 38 studied genes, a significantly higher proportion of changes at donor sites (37/152; 24.3%) than at acceptor splice-sites (1/142; 0.7%) was found to reduce the neural network signal emitted by the respective splice-site. Based upon these findings, we estimate that some 1.6% of disease-causing missense substitutions in human genes are likely to affect the mRNA splicing phenotype. Taken together, our results are consistent with correct donor splice-site recognition being a key step in exon recognition.
RNA structure: implications in viral infections and neurodegenerative diseases.
Lu S, Tang Y, Yin S, Sun L Adv Biotechnol (Singap). 2025; 2(1):3.
PMID: 39883271 PMC: 11740852. DOI: 10.1007/s44307-024-00010-2.
Ridgeway A, Shortall C, Finnegan L, Long R, Matthews E, Dockery A Genes (Basel). 2025; 16(1).
PMID: 39858572 PMC: 11764614. DOI: 10.3390/genes16010025.
Tse V, Guiterrez M, Townley J, Romano J, Pearl J, Chacaltana G bioRxiv. 2024; .
PMID: 39463988 PMC: 11507933. DOI: 10.1101/2024.10.15.618608.
Validating the splicing effect of rare variants in the SLC26A4 gene using minigene assay.
Zhao Y, Long Y, Shi T, Ma X, Lian C, Wang H BMC Med Genomics. 2024; 17(1):233.
PMID: 39334476 PMC: 11430457. DOI: 10.1186/s12920-024-02007-1.
Protein isoform-centric therapeutics: expanding targets and increasing specificity.
Kjer-Hansen P, Phan T, Weatheritt R Nat Rev Drug Discov. 2024; 23(10):759-779.
PMID: 39232238 DOI: 10.1038/s41573-024-01025-z.