» Articles » PMID: 31985791

Bivartect: Accurate and Memory-saving Breakpoint Detection by Direct Read Comparison

Overview
Journal Bioinformatics
Specialty Biology
Date 2020 Jan 28
PMID 31985791
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Genetic variant calling with high-throughput sequencing data has been recognized as a useful tool for better understanding of disease mechanism and detection of potential off-target sites in genome editing. Since most of the variant calling algorithms rely on initial mapping onto a reference genome and tend to predict many variant candidates, variant calling remains challenging in terms of predicting variants with low false positives.

Results: Here we present Bivartect, a simple yet versatile variant caller based on direct comparison of short sequence reads between normal and mutated samples. Bivartect can detect not only single nucleotide variants but also insertions/deletions, inversions and their complexes. Bivartect achieves high predictive performance with an elaborate memory-saving mechanism, which allows Bivartect to run on a computer with a single node for analyzing small omics data. Tests with simulated benchmark and real genome-editing data indicate that Bivartect was comparable to state-of-the-art variant callers in positive predictive value for detection of single nucleotide variants, even though it yielded a substantially small number of candidates. These results suggest that Bivartect, a reference-free approach, will contribute to the identification of germline mutations as well as off-target sites introduced during genome editing with high accuracy.

Availability And Implementation: Bivartect is implemented in C++ and available along with in silico simulated data at https://github.com/ykat0/bivartect.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Detecting gene breakpoints in noisy genome sequences using position-annotated colored de-Bruijn graphs.

Fiedler L, Bernt M, Middendorf M, Stadler P BMC Bioinformatics. 2023; 24(1):235.

PMID: 37277700 PMC: 10243065. DOI: 10.1186/s12859-023-05371-4.


Temporal convolutional network for a Fast DNA mutation detection in breast cancer data.

Wisesty U, Mengko T, Purwarianti A, Pancoro A PLoS One. 2023; 18(5):e0285981.

PMID: 37228159 PMC: 10212167. DOI: 10.1371/journal.pone.0285981.

References
1.
Li H, Durbin R . Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009; 25(14):1754-60. PMC: 2705234. DOI: 10.1093/bioinformatics/btp324. View

2.
Chen K, Wallis J, McLellan M, Larson D, Kalicki J, Pohl C . BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat Methods. 2009; 6(9):677-81. PMC: 3661775. DOI: 10.1038/nmeth.1363. View

3.
Wang J, Mullighan C, Easton J, Roberts S, Heatley S, Ma J . CREST maps somatic structural variation in cancer genomes with base-pair resolution. Nat Methods. 2011; 8(8):652-4. PMC: 3527068. DOI: 10.1038/nmeth.1628. View

4.
Sudmant P, Rausch T, Gardner E, Handsaker R, Abyzov A, Huddleston J . An integrated map of structural variation in 2,504 human genomes. Nature. 2015; 526(7571):75-81. PMC: 4617611. DOI: 10.1038/nature15394. View

5.
Cibulskis K, Lawrence M, Carter S, Sivachenko A, Jaffe D, Sougnez C . Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat Biotechnol. 2013; 31(3):213-9. PMC: 3833702. DOI: 10.1038/nbt.2514. View