» Articles » PMID: 20718989

RSW-seq: Algorithm for Detection of Copy Number Alterations in Deep Sequencing Data

Overview
Publisher Biomed Central
Specialty Biology
Date 2010 Aug 20
PMID 20718989
Citations 27
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Recent advances in sequencing technologies have enabled generation of large-scale genome sequencing data. These data can be used to characterize a variety of genomic features, including the DNA copy number profile of a cancer genome. A robust and reliable method for screening chromosomal alterations would allow a detailed characterization of the cancer genome with unprecedented accuracy.

Results: We develop a method for identification of copy number alterations in a tumor genome compared to its matched control, based on application of Smith-Waterman algorithm to single-end sequencing data. In a performance test with simulated data, our algorithm shows >90% sensitivity and >90% precision in detecting a single copy number change that contains approximately 500 reads for the normal sample. With 100-bp reads, this corresponds to a ~50 kb region for 1X genome coverage of the human genome. We further refine the algorithm to develop rSW-seq, (recursive Smith-Waterman-seq) to identify alterations in a complex configuration, which are commonly observed in the human cancer genome. To validate our approach, we compare our algorithm with an existing algorithm using simulated and publicly available datasets. We also compare the sequencing-based profiles to microarray-based results.

Conclusion: We propose rSW-seq as an efficient method for detecting copy number changes in the tumor genome.

Citing Articles

A bioinformatician, computer scientist, and geneticist lead bioinformatic tool development-which one is better?.

Gardner P Bioinform Adv. 2025; 5(1):vbaf011.

PMID: 39981110 PMC: 11842046. DOI: 10.1093/bioadv/vbaf011.


A comprehensive benchmarking of WGS-based deletion structural variant callers.

Sarwal V, Niehus S, Ayyala R, Kim M, Sarkar A, Chang S Brief Bioinform. 2022; 23(4).

PMID: 35753701 PMC: 9294411. DOI: 10.1093/bib/bbac221.


Current status of structural variation studies in plants.

Yuan Y, Bayer P, Batley J, Edwards D Plant Biotechnol J. 2021; 19(11):2153-2163.

PMID: 34101329 PMC: 8541774. DOI: 10.1111/pbi.13646.


STARCH: copy number and clone inference from spatial transcriptomics data.

Elyanow R, Zeira R, Land M, Raphael B Phys Biol. 2020; 18(3):035001.

PMID: 33022659 PMC: 9876615. DOI: 10.1088/1478-3975/abbe99.


A systematic evaluation of copy number alterations detection methods on real SNP array and deep sequencing data.

Luo F BMC Bioinformatics. 2019; 20(Suppl 25):692.

PMID: 31874603 PMC: 6929333. DOI: 10.1186/s12859-019-3266-7.


References
1.
Lee S, Hormozdiari F, Alkan C, Brudno M . MoDIL: detecting small indels from clone-end sequencing with mixtures of distributions. Nat Methods. 2009; 6(7):473-4. DOI: 10.1038/nmeth.f.256. View

2.
Xie C, Tammi M . CNV-seq, a new method to detect copy number variation using high-throughput sequencing. BMC Bioinformatics. 2009; 10:80. PMC: 2667514. DOI: 10.1186/1471-2105-10-80. View

3.
Chiang D, Getz G, Jaffe D, OKelly M, Zhao X, Carter S . High-resolution mapping of copy-number alterations with massively parallel sequencing. Nat Methods. 2008; 6(1):99-103. PMC: 2630795. DOI: 10.1038/nmeth.1276. View

4.
Albertson D, Pinkel D . Genomic microarrays in human genetic disease and cancer. Hum Mol Genet. 2003; 12 Spec No 2:R145-52. DOI: 10.1093/hmg/ddg261. View

5.
Price T, Regan R, Mott R, Hedman A, Honey B, Daniels R . SW-ARRAY: a dynamic programming solution for the identification of copy-number changes in genomic DNA using array comparative genome hybridization data. Nucleic Acids Res. 2005; 33(11):3455-64. PMC: 1151590. DOI: 10.1093/nar/gki643. View