» Articles » PMID: 27429743

SNPsplit: Allele-specific Splitting of Alignments Between Genomes with Known SNP Genotypes

Overview
Journal F1000Res
Date 2016 Aug 19
PMID 27429743
Citations 106
Authors
Affiliations
Soon will be listed here.
Abstract

Sequencing reads overlapping polymorphic sites in diploid mammalian genomes may be assigned to one allele or the other. This holds the potential to detect gene expression, chromatin modifications, DNA methylation or nuclear interactions in an allele-specific fashion. SNPsplit is an allele-specific alignment sorter designed to read files in SAM/BAM format and determine the allelic origin of reads or read-pairs that cover known single nucleotide polymorphic (SNP) positions. For this to work libraries must have been aligned to a genome in which all known SNP positions were masked with the ambiguity base 'N' and aligned using a suitable mapping program such as Bowtie2, TopHat, STAR, HISAT2, HiCUP or Bismark. SNPsplit also provides an automated solution to generate N-masked reference genomes for hybrid mouse strains based on the variant call information provided by the Mouse Genomes Project. The unique ability of SNPsplit to work with various different kinds of sequencing data including RNA-Seq, ChIP-Seq, Bisulfite-Seq or Hi-C opens new avenues for the integrative exploration of allele-specific data.

Citing Articles

Reconstruction of diploid higher-order human 3D genome interactions from noisy Pore-C data using Dip3D.

Chen Y, Lin Z, Wang S, Wu B, Niu L, Zhong J Nat Struct Mol Biol. 2025; .

PMID: 40038455 DOI: 10.1038/s41594-025-01512-w.


Transcription can be sufficient, but is not necessary, to advance replication timing.

Vouzas A, Sasaki T, Rivera-Mulia J, Turner J, Brown A, Alexander K bioRxiv. 2025; .

PMID: 39975371 PMC: 11838563. DOI: 10.1101/2025.02.04.636516.


Multi-omics analysis in primary T cells elucidates mechanisms behind disease-associated genetic loci.

Shi C, Zhao D, Butler J, Frantzeskos A, Rossi S, Ding J Genome Biol. 2025; 26(1):26.

PMID: 39930543 PMC: 11808986. DOI: 10.1186/s13059-025-03492-y.


Temporal and regional X-linked gene reactivation in the mouse germline reveals site-specific retention of epigenetic silencing.

Roidor C, Syx L, Beyne E, Raynaud P, Zielinski D, Teissandier A Nat Struct Mol Biol. 2025; .

PMID: 39838109 DOI: 10.1038/s41594-024-01469-2.


Acquired sperm hypomethylation by gestational arsenic exposure is re-established in both the paternal and maternal genomes of post-epigenetic reprogramming embryos.

Nohara K, Suzuki T, Okamura K, Kawai T, Nakabayashi K Epigenetics Chromatin. 2025; 18(1):4.

PMID: 39815295 PMC: 11737231. DOI: 10.1186/s13072-025-00569-7.


References
1.
Rao S, Huntley M, Durand N, Stamenova E, Bochkov I, Robinson J . A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014; 159(7):1665-80. PMC: 5635824. DOI: 10.1016/j.cell.2014.11.021. View

2.
Li H, Durbin R . Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics. 2010; 26(5):589-95. PMC: 2828108. DOI: 10.1093/bioinformatics/btp698. View

3.
Degner J, Marioni J, Pai A, Pickrell J, Nkadori E, Gilad Y . Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data. Bioinformatics. 2009; 25(24):3207-12. PMC: 2788925. DOI: 10.1093/bioinformatics/btp579. View

4.
Langmead B, Salzberg S . Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012; 9(4):357-9. PMC: 3322381. DOI: 10.1038/nmeth.1923. View

5.
Castel S, Levy-Moonshine A, Mohammadi P, Banks E, Lappalainen T . Tools and best practices for data processing in allelic expression analysis. Genome Biol. 2015; 16:195. PMC: 4574606. DOI: 10.1186/s13059-015-0762-6. View