Scalable and Unsupervised Discovery from Raw Sequencing Reads Using SPLASH2
Overview
Affiliations
We introduce SPLASH2, a fast, scalable implementation of SPLASH based on an efficient k-mer counting approach for regulated sequence variation detection in massive datasets from a wide range of sequencing technologies and biological contexts. We demonstrate biological discovery by SPLASH2 in single-cell RNA sequencing (RNA-seq) data and in bulk RNA-seq data from the Cancer Cell Line Encyclopedia, including unannotated alternative splicing in cancer transcriptomes and sensitive detection of circular RNA.
sc-SPLASH provides ultra-efficient reference-free discovery in barcoded single-cell sequencing.
Dehghannasiri R, Kokot M, Starr A, Maziarz J, Gordon T, Tan S bioRxiv. 2025; .
PMID: 39763839 PMC: 11703226. DOI: 10.1101/2024.12.24.630263.
A reference-free algorithm discovers regulation in the plant transcriptome.
Meyer E, Saldivar E, Kokot M, Xue B, Deorowicz S, Rhee S bioRxiv. 2024; .
PMID: 38826472 PMC: 11142198. DOI: 10.1101/2024.05.23.595613.
SPLASH: A statistical, reference-free genomic algorithm unifies biological discovery.
Chaung K, Baharav T, Henderson G, Zheludev I, Wang P, Salzman J Cell. 2023; 186(25):5440-5456.e26.
PMID: 38065078 PMC: 10861363. DOI: 10.1016/j.cell.2023.10.028.
OASIS: An interpretable, finite-sample valid alternative to Pearson's for scientific discovery.
Baharav T, Tse D, Salzman J bioRxiv. 2023; .
PMID: 37961606 PMC: 10634974. DOI: 10.1101/2023.03.16.533008.
SPLASH: a statistical, reference-free genomic algorithm unifies biological discovery.
Chaung K, Baharav T, Henderson G, Zheludev I, Wang P, Salzman J bioRxiv. 2022; .
PMID: 35794890 PMC: 9258296. DOI: 10.1101/2022.06.24.497555.