» Articles » PMID: 22276688

An Evaluation of Two-channel ChIP-on-chip and DNA Methylation Microarray Normalization Strategies

Overview
Journal BMC Genomics
Publisher Biomed Central
Specialty Genetics
Date 2012 Jan 27
PMID 22276688
Citations 11
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The combination of chromatin immunoprecipitation with two-channel microarray technology enables genome-wide mapping of binding sites of DNA-interacting proteins (ChIP-on-chip) or sites with methylated CpG di-nucleotides (DNA methylation microarray). These powerful tools are the gateway to understanding gene transcription regulation. Since the goals of such studies, the sample preparation procedures, the microarray content and study design are all different from transcriptomics microarrays, the data pre-processing strategies traditionally applied to transcriptomics microarrays may not be appropriate. Particularly, the main challenge of the normalization of "regulation microarrays" is (i) to make the data of individual microarrays quantitatively comparable and (ii) to keep the signals of the enriched probes, representing DNA sequences from the precipitate, as distinguishable as possible from the signals of the un-enriched probes, representing DNA sequences largely absent from the precipitate.

Results: We compare several widely used normalization approaches (VSN, LOWESS, quantile, T-quantile, Tukey's biweight scaling, Peng's method) applied to a selection of regulation microarray datasets, ranging from DNA methylation to transcription factor binding and histone modification studies. Through comparison of the data distributions of control probes and gene promoter probes before and after normalization, and assessment of the power to identify known enriched genomic regions after normalization, we demonstrate that there are clear differences in performance between normalization procedures.

Conclusion: T-quantile normalization applied separately on the channels and Tukey's biweight scaling outperform other methods in terms of the conservation of enriched and un-enriched signal separation, as well as in identification of genomic regions known to be enriched. T-quantile normalization is preferable as it additionally improves comparability between microarrays. In contrast, popular normalization approaches like quantile, LOWESS, Peng's method and VSN normalization alter the data distributions of regulation microarrays to such an extent that using these approaches will impact the reliability of the downstream analysis substantially.

Citing Articles

Simultaneous Improvement in the Precision, Accuracy, and Robustness of Label-free Proteome Quantification by Optimizing Data Manipulation Chains.

Tang J, Fu J, Wang Y, Luo Y, Yang Q, Li B Mol Cell Proteomics. 2019; 18(8):1683-1699.

PMID: 31097671 PMC: 6682996. DOI: 10.1074/mcp.RA118.001169.


Effects of developmental lead exposure on the hippocampal methylome: Influences of sex and timing and level of exposure.

Singh G, Singh V, Wang Z, Voisin G, Lefebvre F, Navenot J Toxicol Lett. 2018; 290:63-72.

PMID: 29571894 PMC: 5952363. DOI: 10.1016/j.toxlet.2018.03.021.


Novel Epigenetic Biomarkers Mediating Bisphenol A Exposure and Metabolic Phenotypes in Female Mice.

Anderson O, Kim J, Peterson K, Sanchez B, Sant K, Sartor M Endocrinology. 2016; 158(1):31-40.

PMID: 27824486 PMC: 5412976. DOI: 10.1210/en.2016-1441.


Comparison of pre-processing methods for multiplex bead-based immunoassays.

Rausch T, Schillert A, Ziegler A, Luking A, Zucht H, Schulz-Knappe P BMC Genomics. 2016; 17(1):601.

PMID: 27515389 PMC: 4982217. DOI: 10.1186/s12864-016-2888-7.


Methylation Landscape of Human Breast Cancer Cells in Response to Dietary Compound Resveratrol.

Medina-Aguilar R, Perez-Plasencia C, Marchat L, Gariglio P, Garcia Mena J, Rodriguez Cuevas S PLoS One. 2016; 11(6):e0157866.

PMID: 27355345 PMC: 4927060. DOI: 10.1371/journal.pone.0157866.


References
1.
Movassagh M, Choy M, Goddard M, Bennett M, Down T, Foo R . Differential DNA methylation correlates with differential expression of angiogenic factors in human heart failure. PLoS One. 2010; 5(1):e8564. PMC: 2797324. DOI: 10.1371/journal.pone.0008564. View

2.
Kepler T, Crosby L, Morgan K . Normalization and analysis of DNA microarray data by self-consistency and local regression. Genome Biol. 2002; 3(7):RESEARCH0037. PMC: 126242. DOI: 10.1186/gb-2002-3-7-research0037. View

3.
Ordway J, Bedell J, Citek R, Nunberg A, Garrido A, Kendall R . Comprehensive DNA methylation profiling in a human cancer genome identifies novel epigenetic targets. Carcinogenesis. 2006; 27(12):2409-23. DOI: 10.1093/carcin/bgl161. View

4.
Lu R, Lee G, Shultz M, Dardick C, Jung K, Phetsom J . Assessing probe-specific dye and slide biases in two-color microarray data. BMC Bioinformatics. 2008; 9:314. PMC: 2496918. DOI: 10.1186/1471-2105-9-314. View

5.
Huber W, von Heydebreck A, Sultmann H, Poustka A, Vingron M . Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics. 2002; 18 Suppl 1:S96-104. DOI: 10.1093/bioinformatics/18.suppl_1.s96. View