» Articles » PMID: 36949070

Integrated Analysis of Genomic and Transcriptomic Data for the Discovery of Splice-associated Variants in Cancer

Abstract

Somatic mutations within non-coding regions and even exons may have unidentified regulatory consequences that are often overlooked in analysis workflows. Here we present RegTools ( www.regtools.org ), a computationally efficient, free, and open-source software package designed to integrate somatic variants from genomic data with splice junctions from bulk or single cell transcriptomic data to identify variants that may cause aberrant splicing. We apply RegTools to over 9000 tumor samples with both tumor DNA and RNA sequence data. RegTools discovers 235,778 events where a splice-associated variant significantly increases the splicing of a particular junction, across 158,200 unique variants and 131,212 unique junctions. To characterize these somatic variants and their associated splice isoforms, we annotate them with the Variant Effect Predictor, SpliceAI, and Genotype-Tissue Expression junction counts and compare our results to other tools that integrate genomic and transcriptomic data. While many events are corroborated by the aforementioned tools, the flexibility of RegTools also allows us to identify splice-associated variants in known cancer drivers, such as TP53, CDKN2A, and B2M, and other genes.

Citing Articles

The contribution of genetic determinants of blood gene expression and splicing to molecular phenotypes and health outcomes.

Tokolyi A, Persyn E, Nath A, Burnham K, Marten J, Vanderstichele T Nat Genet. 2025; 57(3):616-625.

PMID: 40038547 PMC: 11906350. DOI: 10.1038/s41588-025-02096-3.


Cross-cohort analysis of expression and splicing quantitative trait loci in TOPMed.

Orchard P, Blackwell T, Kachuri L, Castaldi P, Cho M, Christenson S medRxiv. 2025; .

PMID: 40034763 PMC: 11875316. DOI: 10.1101/2025.02.19.25322561.


Long-read RNA sequencing atlas of human microglia isoforms elucidates disease-associated genetic regulation of splicing.

Humphrey J, Brophy E, Kosoy R, Zeng B, Coccia E, Mattei D Nat Genet. 2025; 57(3):604-615.

PMID: 40033057 DOI: 10.1038/s41588-025-02099-0.


Generation of transient totipotent blastomere-like stem cells by short-term high-dose Pladienolide B treatment.

Zhang W, An S, Hou S, He X, Xiang J, Yan H Sci China Life Sci. 2025; .

PMID: 40024996 DOI: 10.1007/s11427-024-2774-2.


Shiba: a versatile computational method for systematic identification of differential RNA splicing across platforms.

Kubota N, Chen L, Zheng S Nucleic Acids Res. 2025; 53(4).

PMID: 39997221 PMC: 11851117. DOI: 10.1093/nar/gkaf098.


References
1.
Zhao R, Choi B, Lee M, Bode A, Dong Z . Implications of Genetic and Epigenetic Alterations of CDKN2A (p16(INK4a)) in Cancer. EBioMedicine. 2016; 8:30-39. PMC: 4919535. DOI: 10.1016/j.ebiom.2016.04.017. View

2.
Rui Y, Xu Z, Lin S, Li Q, Rui H, Luo W . Axin stimulates p53 functions by activation of HIPK2 kinase through multimeric complex formation. EMBO J. 2004; 23(23):4583-94. PMC: 533058. DOI: 10.1038/sj.emboj.7600475. View

3.
Fan Y, Xi L, Hughes D, Zhang J, Zhang J, Futreal P . MuSE: accounting for tumor heterogeneity using a sample-specific error model improves sensitivity and specificity in mutation calling from sequencing data. Genome Biol. 2016; 17(1):178. PMC: 4995747. DOI: 10.1186/s13059-016-1029-6. View

4.
. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013; 45(6):580-5. PMC: 4010069. DOI: 10.1038/ng.2653. View

5.
Chang L, Vural S, Sonkin D . Detection of homozygous deletions in tumor-suppressor genes ranging from dozen to hundreds nucleotides in cancer models. Hum Mutat. 2017; 38(11):1449-1453. DOI: 10.1002/humu.23308. View