» Articles » PMID: 30794641

Batch Adjustment by Reference Alignment (BARA): Improved Prediction Performance in Biological Test Sets with Batch Effects

Overview
Journal PLoS One
Date 2019 Feb 23
PMID 30794641
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Many biological data acquisition platforms suffer from inadvertent inclusion of biologically irrelevant variance in analyzed data, collectively termed batch effects. Batch effects can lead to difficulties in downstream analysis by lowering the power to detect biologically interesting differences and can in certain instances lead to false discoveries. They are especially troublesome in predictive modelling where samples in training sets and test sets are often completely correlated with batches. In this article, we present BARA, a normalization method for adjusting batch effects in predictive modelling. BARA utilizes a few reference samples to adjust for batch effects in a compressed data space spanned by the training set. We evaluate BARA using a collection of publicly available datasets and three different prediction models, and compare its performance to already existing methods developed for similar purposes. The results show that data normalized with BARA generates high and consistent prediction performances. Further, they suggest that BARA produces reliable performances independent of the examined classifiers. We therefore conclude that BARA has great potential to facilitate the development of predictive assays where test sets and training sets are correlated with batch.

Citing Articles

Circulating Chromosome Conformation Signatures Significantly Enhance PSA Positive Predicting Value and Overall Accuracy for Prostate Cancer Detection.

Pchejetski D, Hunter E, Dezfouli M, Salter M, Powell R, Green J Cancers (Basel). 2023; 15(3).

PMID: 36765779 PMC: 9913359. DOI: 10.3390/cancers15030821.


Methods for predicting vaccine immunogenicity and reactogenicity.

Gonzalez-Dias P, Lee E, Sorgi S, de Lima D, Urbanski A, Silveira E Hum Vaccin Immunother. 2019; 16(2):269-276.

PMID: 31869262 PMC: 7062420. DOI: 10.1080/21645515.2019.1697110.

References
1.
Dhingra N, Shemer A, Da Rosa J, Rozenblit M, Fuentes-Duculan J, Gittler J . Molecular profiling of contact dermatitis skin identifies allergen-dependent differences in immune response. J Allergy Clin Immunol. 2014; 134(2):362-72. DOI: 10.1016/j.jaci.2014.03.009. View

2.
Lambert S, Mladkova N, Gulati A, Hamoudi R, Purdie K, Cerio R . Key differences identified between actinic keratosis and cutaneous squamous cell carcinoma by transcriptome profiling. Br J Cancer. 2013; 110(2):520-9. PMC: 3899778. DOI: 10.1038/bjc.2013.760. View

3.
Wang L, Shen X, Wang Z, Xiao X, Wei P, Wang Q . A molecular signature for the prediction of recurrence in colorectal cancer. Mol Cancer. 2015; 14:22. PMC: 4320628. DOI: 10.1186/s12943-015-0296-2. View

4.
McLerran D, Grizzle W, Feng Z, Thompson I, Bigbee W, Cazares L . SELDI-TOF MS whole serum proteomic profiling with IMAC surface does not reliably detect prostate cancer. Clin Chem. 2007; 54(1):53-60. PMC: 4332515. DOI: 10.1373/clinchem.2007.091496. View

5.
Metzelder S, Michel C, von Bonin M, Rehberger M, Hessmann E, Inselmann S . NFATc1 as a therapeutic target in FLT3-ITD-positive AML. Leukemia. 2015; 29(7):1470-7. DOI: 10.1038/leu.2015.95. View