» Articles » PMID: 39397427

Thinking Points for Effective Batch Correction on Biomedical Data

Overview
Journal Brief Bioinform
Specialty Biology
Date 2024 Oct 14
PMID 39397427
Authors
Affiliations
Soon will be listed here.
Abstract

Batch effects introduce significant variability into high-dimensional data, complicating accurate analysis and leading to potentially misleading conclusions if not adequately addressed. Despite technological and algorithmic advancements in biomedical research, effectively managing batch effects remains a complex challenge requiring comprehensive considerations. This paper underscores the necessity of a flexible and holistic approach for selecting batch effect correction algorithms (BECAs), advocating for proper BECA evaluations and consideration of artificial intelligence-based strategies. We also discuss key challenges in batch effect correction, including the importance of uncovering hidden batch factors and understanding the impact of design imbalance, missing values, and aggressive correction. Our aim is to provide researchers with a robust framework for effective batch effects management and enhancing the reliability of high-dimensional data analyses.

References
1.
Xiong J, Gong F, Ma L, Wan L . scVIC: deep generative modeling of heterogeneity for scRNA-seq data. Bioinform Adv. 2024; 4(1):vbae086. PMC: 11256938. DOI: 10.1093/bioadv/vbae086. View

2.
Parker H, Leek J, Favorov A, Considine M, Xia X, Chavan S . Preserving biological heterogeneity with a permuted surrogate variable analysis for genomics batch correction. Bioinformatics. 2014; 30(19):2757-63. PMC: 4173013. DOI: 10.1093/bioinformatics/btu375. View

3.
Tran H, Ang K, Chevrier M, Zhang X, Lee N, Goh M . A benchmark of batch-effect correction methods for single-cell RNA sequencing data. Genome Biol. 2020; 21(1):12. PMC: 6964114. DOI: 10.1186/s13059-019-1850-9. View

4.
Hui H, Kong W, Peng H, Goh W . The importance of batch sensitization in missing value imputation. Sci Rep. 2023; 13(1):3003. PMC: 9944322. DOI: 10.1038/s41598-023-30084-2. View

5.
Goh W, Wang W, Wong L . Why Batch Effects Matter in Omics Data, and How to Avoid Them. Trends Biotechnol. 2017; 35(6):498-507. DOI: 10.1016/j.tibtech.2017.02.012. View