» Articles » PMID: 30989231

SurVIndel: Improving CNV Calling from High-throughput Sequencing Data Through Statistical Testing

Overview
Journal Bioinformatics
Specialty Biology
Date 2019 Apr 17
PMID 30989231
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Structural variations (SVs) are large scale mutations in a genome; although less frequent than point mutations, due to their large size they are responsible for more heritable differences between individuals. Two prominent classes of SVs are deletions and tandem duplications. They play important roles in many devastating genetic diseases, such as Smith-Magenis syndrome, Potocki-Lupski syndrome and Williams-Beuren syndrome. Since paired-end whole genome sequencing data have become widespread and affordable, reliably calling deletions and tandem duplications has been a major target in bioinformatics; unfortunately, the problem is far from being solved, since existing solutions often offer poor results when applied to real data.

Results: We developed a novel caller, SurVIndel, which focuses on detecting deletions and tandem duplications from paired next-generation sequencing data. SurVIndel uses discordant paired reads, clipped reads as well as statistical methods. We show that SurVIndel outperforms existing methods on both simulated and real biological datasets.

Availability And Implementation: SurVIndel is available at https://github.com/Mesh89/SurVIndel.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

SurVIndel2: improving copy number variant calling from next-generation sequencing using hidden split reads.

Rajaby R, Sung W Nat Commun. 2024; 15(1):10473.

PMID: 39622819 PMC: 11612505. DOI: 10.1038/s41467-024-53087-7.


A Catalogue of Structural Variation across Ancestrally Diverse Asian Genomes.

Juan Tan J, Li Z, Porta M, Rajaby R, Lim W, Tan Y Nat Commun. 2024; 15(1):9507.

PMID: 39496583 PMC: 11535549. DOI: 10.1038/s41467-024-53620-8.


FindCSV: a long-read based method for detecting complex structural variations.

Zheng Y, Shang X BMC Bioinformatics. 2024; 25(1):315.

PMID: 39342151 PMC: 11439270. DOI: 10.1186/s12859-024-05937-w.


Constructing telomere-to-telomere diploid genome by polishing haploid nanopore-based assembly.

Darian J, Kundu R, Rajaby R, Sung W Nat Methods. 2024; 21(4):574-583.

PMID: 38459383 DOI: 10.1038/s41592-023-02141-1.


SVvalidation: A long-read-based validation method for genomic structural variation.

Zheng Y, Shang X PLoS One. 2024; 19(1):e0291741.

PMID: 38181020 PMC: 10769053. DOI: 10.1371/journal.pone.0291741.