» Articles » PMID: 18831757

Normalization of Illumina Infinium Whole-genome SNP Data Improves Copy Number Estimates and Allelic Intensity Ratios

Overview
Publisher Biomed Central
Specialty Biology
Date 2008 Oct 4
PMID 18831757
Citations 80
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Illumina Infinium whole genome genotyping (WGG) arrays are increasingly being applied in cancer genomics to study gene copy number alterations and allele-specific aberrations such as loss-of-heterozygosity (LOH). Methods developed for normalization of WGG arrays have mostly focused on diploid, normal samples. However, for cancer samples genomic aberrations may confound normalization and data interpretation. Therefore, we examined the effects of the conventionally used normalization method for Illumina Infinium arrays when applied to cancer samples.

Results: We demonstrate an asymmetry in the detection of the two alleles for each SNP, which deleteriously influences both allelic proportions and copy number estimates. The asymmetry is caused by a remaining bias between the two dyes used in the Infinium II assay after using the normalization method in Illumina's proprietary software (BeadStudio). We propose a quantile normalization strategy for correction of this dye bias. We tested the normalization strategy using 535 individual hybridizations from 10 data sets from the analysis of cancer genomes and normal blood samples generated on Illumina Infinium II 300 k version 1 and 2, 370 k and 550 k BeadChips. We show that the proposed normalization strategy successfully removes asymmetry in estimates of both allelic proportions and copy numbers. Additionally, the normalization strategy reduces the technical variation for copy number estimates while retaining the response to copy number alterations.

Conclusion: The proposed normalization strategy represents a valuable tool that improves the quality of data obtained from Illumina Infinium arrays, in particular when used for LOH and copy number variation studies.

Citing Articles

Parkinson's Disease Pathogenic Variants: Cross-Ancestry Analysis and Microarray Data Validation.

Hong S, Koretsky M, Lichtenberg J, Leonard H, Pitz V medRxiv. 2025; .

PMID: 39763553 PMC: 11702716. DOI: 10.1101/2024.12.16.24319097.


QTL identified that influence tuber length-width ratio, degree of flatness, tuber size, and specific gravity in a russet-skinned, tetraploid mapping population.

Park J, Whitworth J, Novy R Front Plant Sci. 2024; 15:1343632.

PMID: 38584948 PMC: 10996053. DOI: 10.3389/fpls.2024.1343632.


Identification of QTL associated with plant vine characteristics and infection response to late blight, early blight, and Verticillium wilt in a tetraploid potato population derived from late blight-resistant Palisade Russet.

Park J, Sathuvalli V, Yilma S, Whitworth J, Novy R Front Plant Sci. 2023; 14:1222596.

PMID: 37900754 PMC: 10600477. DOI: 10.3389/fpls.2023.1222596.


Preclinical evaluation of CDK4 phosphorylation predicts high sensitivity of pleural mesotheliomas to CDK4/6 inhibition.

Paternot S, Raspe E, Meiller C, Tarabichi M, Assie J, Libert F Mol Oncol. 2022; 18(4):866-894.

PMID: 36453028 PMC: 10994244. DOI: 10.1002/1878-0261.13351.


Construction of transgenic detection system of L. based on single nucleotide polymorphism chip.

Zhou E, Song N, Xiao Q, Farooq Z, Jia Z, Wen J 3 Biotech. 2021; 12(1):11.

PMID: 34966634 PMC: 8655060. DOI: 10.1007/s13205-021-03062-6.


References
1.
Yang Y, Dudoit S, Luu P, Lin D, Peng V, Ngai J . Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 2002; 30(4):e15. PMC: 100354. DOI: 10.1093/nar/30.4.e15. View

2.
Oosting J, Lips E, van Eijk R, Eilers P, Szuhai K, Wijmenga C . High-resolution copy number analysis of paraffin-embedded archival tissue using SNP BeadArrays. Genome Res. 2007; 17(3):368-76. PMC: 1800928. DOI: 10.1101/gr.5686107. View

3.
Wang K, Li M, Hadley D, Liu R, Glessner J, Grant S . PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. Genome Res. 2007; 17(11):1665-74. PMC: 2045149. DOI: 10.1101/gr.6861907. View

4.
Gunnarsson R, Staaf J, Jansson M, Ottesen A, Goransson H, Liljedahl U . Screening for copy-number alterations and loss of heterozygosity in chronic lymphocytic leukemia--a comparative study of four differently designed, high resolution microarray platforms. Genes Chromosomes Cancer. 2008; 47(8):697-711. DOI: 10.1002/gcc.20575. View

5.
Barnes M, Freudenberg J, Thompson S, Aronow B, Pavlidis P . Experimental comparison and cross-validation of the Affymetrix and Illumina gene expression analysis platforms. Nucleic Acids Res. 2005; 33(18):5914-23. PMC: 1258170. DOI: 10.1093/nar/gki890. View