» Articles » PMID: 23442169

Copynumber: Efficient Algorithms for Single- and Multi-track Copy Number Segmentation

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2013 Feb 28
PMID 23442169
Citations 162
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Cancer progression is associated with genomic instability and an accumulation of gains and losses of DNA. The growing variety of tools for measuring genomic copy numbers, including various types of array-CGH, SNP arrays and high-throughput sequencing, calls for a coherent framework offering unified and consistent handling of single- and multi-track segmentation problems. In addition, there is a demand for highly computationally efficient segmentation algorithms, due to the emergence of very high density scans of copy number.

Results: A comprehensive Bioconductor package for copy number analysis is presented. The package offers a unified framework for single sample, multi-sample and multi-track segmentation and is based on statistically sound penalized least squares principles. Conditional on the number of breakpoints, the estimates are optimal in the least squares sense. A novel and computationally highly efficient algorithm is proposed that utilizes vector-based operations in R. Three case studies are presented.

Conclusions: The R package copynumber is a software suite for segmentation of single- and multi-track copy number data using algorithms based on coherent least squares principles.

Citing Articles

Protocol for genome-wide analysis of somatic variants at single-cell resolution using primary template-directed DNA amplification.

Derks L, van Leeuwen A, Steemers A, Trabut L, van Roosmalen M, Poort V STAR Protoc. 2024; 6(1):103499.

PMID: 39709610 PMC: 11726792. DOI: 10.1016/j.xpro.2024.103499.


Extensive epigenomic dysregulation is a hallmark of homologous recombination deficiency in triple-negative breast cancer.

Chen Y, Salas L, Marotti J, Jenkins N, Cheng C, Miller T Int J Cancer. 2024; 156(6):1191-1202.

PMID: 39635770 PMC: 11738659. DOI: 10.1002/ijc.35274.


Uterine mesenchymal tumours harboring the KAT6B/A::KANSL1 gene fusion represent a distinct type of uterine sarcoma based on DNA methylation profiles.

Kommoss F, Charbel A, Kolin D, Howitt B, Kobel M, Lee J Virchows Arch. 2024; 485(5):793-803.

PMID: 39392508 PMC: 11564218. DOI: 10.1007/s00428-024-03935-0.


CDK4 is co-amplified with either TP53 promoter gene fusions or MDM2 through distinct mechanisms in osteosarcoma.

Saba K, Difilippo V, Styring E, Nilsson J, Magnusson L, van den Bos H NPJ Genom Med. 2024; 9(1):42.

PMID: 39322633 PMC: 11424644. DOI: 10.1038/s41525-024-00430-y.


Clinical-grade whole genome sequencing-based haplarithmisis enables all forms of preimplantation genetic testing.

Janssen A, Koeck R, Essers R, Cao P, van Dijk W, Drusedau M Nat Commun. 2024; 15(1):7164.

PMID: 39223156 PMC: 11369272. DOI: 10.1038/s41467-024-51508-1.


References
1.
Hupe P, Stransky N, Thiery J, Radvanyi F, Barillot E . Analysis of array CGH data: from signal ratio to gain and loss of DNA regions. Bioinformatics. 2004; 20(18):3413-22. DOI: 10.1093/bioinformatics/bth418. View

2.
Aguirre A, Brennan C, Bailey G, Sinha R, Feng B, Leo C . High-resolution characterization of the pancreatic adenocarcinoma genome. Proc Natl Acad Sci U S A. 2004; 101(24):9067-72. PMC: 428474. DOI: 10.1073/pnas.0402932101. View

3.
Lai W, Choudhary V, Park P . CGHweb: a tool for comparing DNA copy number segmentations from multiple algorithms. Bioinformatics. 2008; 24(7):1014-5. PMC: 2516369. DOI: 10.1093/bioinformatics/btn067. View

4.
Marioni J, Thorne N, Valsesia A, Fitzgerald T, Redon R, Fiegler H . Breaking the waves: improved detection of copy number variation from microarray-based comparative genomic hybridization. Genome Biol. 2007; 8(10):R228. PMC: 2246302. DOI: 10.1186/gb-2007-8-10-r228. View

5.
Shah S, Lam W, Ng R, Murphy K . Modeling recurrent DNA copy number alterations in array CGH data. Bioinformatics. 2007; 23(13):i450-8. DOI: 10.1093/bioinformatics/btm221. View