PyComBat, a Python Tool for Batch Effects Correction in High-throughput Molecular Data Using Empirical Bayes Methods
Overview
Authors
Affiliations
Background: Variability in datasets is not only the product of biological processes: they are also the product of technical biases. ComBat and ComBat-Seq are among the most widely used tools for correcting those technical biases, called batch effects, in, respectively, microarray and RNA-Seq expression data.
Results: In this technical note, we present a new Python implementation of ComBat and ComBat-Seq. While the mathematical framework is strictly the same, we show here that our implementations: (i) have similar results in terms of batch effects correction; (ii) are as fast or faster than the original implementations in R and; (iii) offer new tools for the bioinformatics community to participate in its development. pyComBat is implemented in the Python language and is distributed under GPL-3.0 ( https://www.gnu.org/licenses/gpl-3.0.en.html ) license as a module of the inmoose package. Source code is available at https://github.com/epigenelabs/inmoose and Python package at https://pypi.org/project/inmoose .
Conclusions: We present a new Python implementation of state-of-the-art tools ComBat and ComBat-Seq for the correction of batch effects in microarray and RNA-Seq data. This new implementation, based on the same mathematical frameworks as ComBat and ComBat-Seq, offers similar power for batch effect correction, at reduced computational cost.
AI Model for Predicting Anti-PD1 Response in Melanoma Using Multi-Omics Biomarkers.
Gschwind A, Ossowski S Cancers (Basel). 2025; 17(5).
PMID: 40075562 PMC: 11899402. DOI: 10.3390/cancers17050714.
Exploring NLRP3-related phenotypic fingerprints in human macrophages using Cell Painting assay.
Herring M, Sarndahl E, Kotlyar O, Scherbak N, Engwall M, Karlsson R iScience. 2025; 28(3):111961.
PMID: 40040812 PMC: 11876907. DOI: 10.1016/j.isci.2025.111961.
Ivarsson Orrelid C, Rosberg O, Weiner S, Johansson F, Gobom J, Zetterberg H Fluids Barriers CNS. 2025; 22(1):23.
PMID: 40033432 PMC: 11874791. DOI: 10.1186/s12987-025-00634-z.
Pardo A, Pardo J, VanBuren R bioRxiv. 2025; .
PMID: 40027706 PMC: 11870519. DOI: 10.1101/2025.02.15.638452.
Maarseveen T, Glas H, Veris-van Dieren J, van den Akker E, Knevel R NPJ Digit Med. 2025; 8(1):98.
PMID: 39948271 PMC: 11825706. DOI: 10.1038/s41746-025-01495-4.