Breaking the Waves: Improved Detection of Copy Number Variation from Microarray-based Comparative Genomic Hybridization

Overview

Journal Genome Biol

Specialties Biology
Genetics

Date 2007 Oct 27

PMID 17961237

Citations 77

Authors

John C Marioni

Natalie P Thorne

Armand Valsesia

Tomas Fitzgerald

Richard Redon

Heike Fiegler

T Daniel Andrews

Barbara E Stranger

Andrew G Lynch

Emmanouil T Dermitzakis

Nigel P Carter

Simon Tavare

Matthew E Hurles

Affiliations

Soon will be listed here.

Abstract

Background: Large-scale high throughput studies using microarray technology have established that copy number variation (CNV) throughout the genome is more frequent than previously thought. Such variation is known to play an important role in the presence and development of phenotypes such as HIV-1 infection and Alzheimer's disease. However, methods for analyzing the complex data produced and identifying regions of CNV are still being refined.

Results: We describe the presence of a genome-wide technical artifact, spatial autocorrelation or 'wave', which occurs in a large dataset used to determine the location of CNV across the genome. By removing this artifact we are able to obtain both a more biologically meaningful clustering of the data and an increase in the number of CNVs identified by current calling methods without a major increase in the number of false positives detected. Moreover, removing this artifact is critical for the development of a novel model-based CNV calling algorithm - CNVmix - that uses cross-sample information to identify regions of the genome where CNVs occur. For regions of CNV that are identified by both CNVmix and current methods, we demonstrate that CNVmix is better able to categorize samples into groups that represent copy number gains or losses.

Conclusion: Removing artifactual 'waves' (which appear to be a general feature of array comparative genomic hybridization (aCGH) datasets) and using cross-sample information when identifying CNVs enables more biological information to be extracted from aCGH experiments designed to investigate copy number variation in normal individuals.

Citing Articles

MicroRNAs Expression Profile in MN1-Altered Astroblastoma.

Gianno F, Miele E, Sabato C, Ferretti E, Minasi S, Buttarelli F Biomedicines. 2025; 13(1).

PMID: 39857696 PMC: 11762140. DOI: 10.3390/biomedicines13010112.

Improving CNV Detection Performance in Microarray Data Using a Machine Learning-Based Approach.

Goh C, Kwon H, Kim Y, Jung S, Park J, Lee I Diagnostics (Basel). 2024; 14(1).

PMID: 38201393 PMC: 10871075. DOI: 10.3390/diagnostics14010084.

Genome-wide association studies for economically important traits in mink using copy number variation.

Davoudi P, Do D, Colombo S, Rathgeber B, Sargolzaei M, Plastow G Sci Rep. 2024; 14(1):24.

PMID: 38167844 PMC: 10762091. DOI: 10.1038/s41598-023-50497-3.

A pipeline for copy number profiling of single circulating tumour cells to assess intrapatient tumour heterogeneity.

Deger T, Mendelaar P, Kraan J, Prager-van der Smissen W, van der Vlugt-Daane M, Bindels E Mol Oncol. 2021; 16(16):2981-3000.

PMID: 34964258 PMC: 9394233. DOI: 10.1002/1878-0261.13174.

Bayesian copy number detection and association in large-scale studies.

Cristiano S, McKean D, Carey J, Bracci P, Brennan P, Chou M BMC Cancer. 2020; 20(1):856.

PMID: 32894098 PMC: 7487704. DOI: 10.1186/s12885-020-07304-3.

References

Engler D, Mohapatra G, Louis D, Betensky R . A pseudolikelihood approach for simultaneous analysis of array comparative genomic hybridizations. Biostatistics. 2006; 7(3):399-421. DOI: 10.1093/biostatistics/kxj015. View

Fiegler H, Redon R, Andrews D, Scott C, Andrews R, Carder C . Accurate and reliable high-throughput detection of copy number variation in the human genome. Genome Res. 2006; 16(12):1566-74. PMC: 1665640. DOI: 10.1101/gr.5630906. View

Wong K, deLeeuw R, Dosanjh N, Kimm L, Cheng Z, Horsman D . A comprehensive analysis of common copy-number variations in the human genome. Am J Hum Genet. 2006; 80(1):91-104. PMC: 1785303. DOI: 10.1086/510560. View

Veltman J, Schoenmakers E, Eussen B, Janssen I, Merkx G, van Cleef B . High-throughput analysis of subtelomeric chromosome rearrangements by use of array-based comparative genomic hybridization. Am J Hum Genet. 2002; 70(5):1269-76. PMC: 447601. DOI: 10.1086/340426. View

Broet P, Richardson S . Detection of gene copy number changes in CGH microarrays using a spatially correlated mixture model. Bioinformatics. 2006; 22(8):911-8. DOI: 10.1093/bioinformatics/btl035. View

Olshen A, Venkatraman E, Lucito R, Wigler M . Circular binary segmentation for the analysis of array-based DNA copy number data. Biostatistics. 2004; 5(4):557-72. DOI: 10.1093/biostatistics/kxh008. View

Diskin S, Eck T, Greshock J, Mosse Y, Naylor T, Stoeckert Jr C . STAC: A method for testing the significance of DNA copy number aberrations across multiple array-CGH experiments. Genome Res. 2006; 16(9):1149-58. PMC: 1557772. DOI: 10.1101/gr.5076506. View

Conrad D, Andrews T, Carter N, Hurles M, Pritchard J . A high-resolution survey of deletion polymorphism in the human genome. Nat Genet. 2005; 38(1):75-81. DOI: 10.1038/ng1697. View

. A haplotype map of the human genome. Nature. 2005; 437(7063):1299-320. PMC: 1880871. DOI: 10.1038/nature04226. View

10.

Ionita I, Daruwala R, Mishra B . Mapping tumor-suppressor genes with multipoint statistics from copy-number-variation data. Am J Hum Genet. 2006; 79(1):13-22. PMC: 1474131. DOI: 10.1086/504354. View

11.

van de Wiel M, Kim K, Vosse S, van Wieringen W, Wilting S, Ylstra B . CGHcall: calling aberrations for array CGH tumor profiles. Bioinformatics. 2007; 23(7):892-4. DOI: 10.1093/bioinformatics/btm030. View

12.

Redon R, Ishikawa S, Fitch K, Feuk L, Perry G, Andrews T . Global variation in copy number in the human genome. Nature. 2006; 444(7118):444-54. PMC: 2669898. DOI: 10.1038/nature05329. View