» Articles » PMID: 24941114

Evaluation of Bias-variance Trade-off for Commonly Used Post-summarizing Normalization Procedures in Large-scale Gene Expression Studies

Overview
Journal PLoS One
Date 2014 Jun 19
PMID 24941114
Citations 8
Authors
Affiliations
Soon will be listed here.
Abstract

Normalization procedures are widely used in high-throughput genomic data analyses to remove various technological noise and variations. They are known to have profound impact to the subsequent gene differential expression analysis. Although there has been some research in evaluating different normalization procedures, few attempts have been made to systematically evaluate the gene detection performances of normalization procedures from the bias-variance trade-off point of view, especially with strong gene differentiation effects and large sample size. In this paper, we conduct a thorough study to evaluate the effects of normalization procedures combined with several commonly used statistical tests and MTPs under different configurations of effect size and sample size. We conduct theoretical evaluation based on a random effect model, as well as simulation and biological data analyses to verify the results. Based on our findings, we provide some practical guidance for selecting a suitable normalization procedure under different scenarios.

Citing Articles

FastMix: a versatile data integration pipeline for cell type-specific biomarker inference.

Zhang Y, Sun H, Mandava A, Aevermann B, Kollmann T, Scheuermann R Bioinformatics. 2022; 38(20):4735-4744.

PMID: 36018232 PMC: 9801972. DOI: 10.1093/bioinformatics/btac585.


Airway gene-expression classifiers for respiratory syncytial virus (RSV) disease severity in infants.

Wang L, Chu C, McCall M, Slaunwhite C, Holden-Wiltse J, Corbett A BMC Med Genomics. 2021; 14(1):57.

PMID: 33632195 PMC: 7908785. DOI: 10.1186/s12920-021-00913-2.


Microarray Normalization Revisited for Reproducible Breast Cancer Biomarkers.

Kenn M, Cacsire Castillo-Tong D, Singer C, Cibena M, Kolbl H, Schreiner W Biomed Res Int. 2020; 2020:1363827.

PMID: 32832541 PMC: 7428878. DOI: 10.1155/2020/1363827.


Defining housekeeping genes suitable for RNA-seq analysis of the human allograft kidney biopsy tissue.

Wang Z, Lyu Z, Pan L, Zeng G, Randhawa P BMC Med Genomics. 2019; 12(1):86.

PMID: 31208411 PMC: 6580566. DOI: 10.1186/s12920-019-0538-z.


Aims, Study Design, and Enrollment Results From the Assessing Predictors of Infant Respiratory Syncytial Virus Effects and Severity Study.

Walsh E, Mariani T, Chu C, Grier A, Gill S, Qiu X JMIR Res Protoc. 2019; 8(6):e12907.

PMID: 31199303 PMC: 6595944. DOI: 10.2196/12907.


References
1.
Tsodikov A, Szabo A, Jones D . Adjustments and measures of differential expression for microarray data. Bioinformatics. 2002; 18(2):251-60. DOI: 10.1093/bioinformatics/18.2.251. View

2.
Johnson W, Li C, Rabinovic A . Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2006; 8(1):118-27. DOI: 10.1093/biostatistics/kxj037. View

3.
Bullard J, Purdom E, Hansen K, Dudoit S . Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC Bioinformatics. 2010; 11:94. PMC: 2838869. DOI: 10.1186/1471-2105-11-94. View

4.
Huber W, von Heydebreck A, Sultmann H, Poustka A, Vingron M . Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics. 2002; 18 Suppl 1:S96-104. DOI: 10.1093/bioinformatics/18.suppl_1.s96. View

5.
Yang Y, Dudoit S, Luu P, Lin D, Peng V, Ngai J . Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res. 2002; 30(4):e15. PMC: 100354. DOI: 10.1093/nar/30.4.e15. View