» Articles » PMID: 26170239

BatAlign: an Incremental Method for Accurate Alignment of Sequencing Reads

Overview
Specialty Biochemistry
Date 2015 Jul 15
PMID 26170239
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Structural variations (SVs) play a crucial role in genetic diversity. However, the alignments of reads near/across SVs are made inaccurate by the presence of polymorphisms. BatAlign is an algorithm that integrated two strategies called 'Reverse-Alignment' and 'Deep-Scan' to improve the accuracy of read-alignment. In our experiments, BatAlign was able to obtain the highest F-measures in read-alignments on mismatch-aberrant, indel-aberrant, concordantly/discordantly paired and SV-spanning data sets. On real data, the alignments of BatAlign were able to recover 4.3% more PCR-validated SVs with 73.3% less callings. These suggest BatAlign to be effective in detecting SVs and other polymorphic-variants accurately using high-throughput data. BatAlign is publicly available at https://goo.gl/a6phxB.

Citing Articles

Benchmarking DNA methylation analysis of 14 alignment algorithms for whole genome bisulfite sequencing in mammals.

Gong W, Pan X, Xu D, Ji G, Wang Y, Tian Y Comput Struct Biotechnol J. 2022; 20:4704-4716.

PMID: 36147684 PMC: 9465269. DOI: 10.1016/j.csbj.2022.08.051.


An integrated package for bisulfite DNA methylation data analysis with Indel-sensitive mapping.

Zhou Q, Lim J, Sung W, Li G BMC Bioinformatics. 2019; 20(1):47.

PMID: 30669962 PMC: 6343306. DOI: 10.1186/s12859-018-2593-4.


Performance evaluation method for read mapping tool in clinical panel sequencing.

Lee H, Lee K, Lee T, Park D, Chung J, Lee C Genes Genomics. 2018; 40(2):189-197.

PMID: 29568413 PMC: 5846869. DOI: 10.1007/s13258-017-0621-9.


AlignerBoost: A Generalized Software Toolkit for Boosting Next-Gen Sequencing Mapping Accuracy Using a Bayesian-Based Mapping Quality Framework.

Zheng Q, Grice E PLoS Comput Biol. 2016; 12(10):e1005096.

PMID: 27706155 PMC: 5051939. DOI: 10.1371/journal.pcbi.1005096.


Suitability of Different Mapping Algorithms for Genome-Wide Polymorphism Scans with Pool-Seq Data.

Kofler R, Langmuller A, Nouhaud P, Otte K, Schlotterer C G3 (Bethesda). 2016; 6(11):3507-3515.

PMID: 27613752 PMC: 5100849. DOI: 10.1534/g3.116.034488.

References
1.
Gentleman R, Carey V, Bates D, Bolstad B, Dettling M, Dudoit S . Bioconductor: open software development for computational biology and bioinformatics. Genome Biol. 2004; 5(10):R80. PMC: 545600. DOI: 10.1186/gb-2004-5-10-r80. View

2.
Ma B, Tromp J, Li M . PatternHunter: faster and more sensitive homology search. Bioinformatics. 2002; 18(3):440-5. DOI: 10.1093/bioinformatics/18.3.440. View

3.
Li R, Li Y, Kristiansen K, Wang J . SOAP: short oligonucleotide alignment program. Bioinformatics. 2008; 24(5):713-4. DOI: 10.1093/bioinformatics/btn025. View

4.
Farrar M . Striped Smith-Waterman speeds database searches six times over other SIMD implementations. Bioinformatics. 2006; 23(2):156-61. DOI: 10.1093/bioinformatics/btl582. View

5.
Mills R, Luttig C, Larkins C, Beauchamp A, Tsui C, Pittard W . An initial map of insertion and deletion (INDEL) variation in the human genome. Genome Res. 2006; 16(9):1182-90. PMC: 1557762. DOI: 10.1101/gr.4565806. View