» Articles » PMID: 39095341

Evaluating Batch Correction Methods for Image-based Cell Profiling

Overview
Journal Nat Commun
Specialty Biology
Date 2024 Aug 2
PMID 39095341
Authors
Affiliations
Soon will be listed here.
Abstract

High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands of perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, such as predicting drug mechanism of action or gene function. However, batch effects severely limit community-wide efforts to integrate and interpret image-based profiling data collected across different laboratories and equipment. To address this problem, we benchmark ten high-performing single-cell RNA sequencing (scRNA-seq) batch correction techniques, representing diverse approaches, using a newly released Cell Painting dataset, JUMP. We focus on five scenarios with varying complexity, ranging from batches prepared in a single lab over time to batches imaged using different microscopes in multiple labs. We find that Harmony and Seurat RPCA are noteworthy, consistently ranking among the top three methods for all tested scenarios while maintaining computational efficiency. Our proposed framework, benchmark, and metrics can be used to assess new batch correction methods in the future. This work paves the way for improvements that enable the community to make the best use of public Cell Painting data for scientific discovery.

Citing Articles

Reproducible image-based profiling with Pycytominer.

Serrano E, Chandrasekaran S, Bunten D, Brewer K, Tomkinson J, Kern R Nat Methods. 2025; .

PMID: 40032995 DOI: 10.1038/s41592-025-02611-8.


Cell Painting for cytotoxicity and mode-of-action analysis in primary human hepatocytes.

Ewald J, Titterton K, Bauerle A, Beatson A, Boiko D, Cabrera A bioRxiv. 2025; .

PMID: 39896617 PMC: 11785178. DOI: 10.1101/2025.01.22.634152.


A genome-wide atlas of human cell morphology.

Ramezani M, Weisbart E, Bauman J, Singh A, Yong J, Lozada M Nat Methods. 2025; 22(3):621-633.

PMID: 39870862 PMC: 11903339. DOI: 10.1038/s41592-024-02537-7.


Artificial Intelligence and Neuroscience: Transformative Synergies in Brain Research and Clinical Applications.

Onciul R, Tataru C, Dumitru A, Crivoi C, Serban M, Covache-Busuioc R J Clin Med. 2025; 14(2).

PMID: 39860555 PMC: 11766073. DOI: 10.3390/jcm14020550.


Predicting cell morphological responses to perturbations using generative modeling.

Palma A, Theis F, Lotfollahi M Nat Commun. 2025; 16(1):505.

PMID: 39779675 PMC: 11711326. DOI: 10.1038/s41467-024-55707-8.


References
1.
Haghighi M, Caicedo J, Cimini B, Carpenter A, Singh S . High-dimensional gene expression and morphology profiles of cells across 28,000 genetic and chemical perturbations. Nat Methods. 2022; 19(12):1550-1557. PMC: 10012424. DOI: 10.1038/s41592-022-01667-0. View

2.
Johnson W, Li C, Rabinovic A . Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2006; 8(1):118-27. DOI: 10.1093/biostatistics/kxj037. View

3.
Korsunsky I, Millard N, Fan J, Slowikowski K, Zhang F, Wei K . Fast, sensitive and accurate integration of single-cell data with Harmony. Nat Methods. 2019; 16(12):1289-1296. PMC: 6884693. DOI: 10.1038/s41592-019-0619-0. View

4.
Tran H, Ang K, Chevrier M, Zhang X, Lee N, Goh M . A benchmark of batch-effect correction methods for single-cell RNA sequencing data. Genome Biol. 2020; 21(1):12. PMC: 6964114. DOI: 10.1186/s13059-019-1850-9. View

5.
Hao Y, Stuart T, Kowalski M, Choudhary S, Hoffman P, Hartman A . Dictionary learning for integrative, multimodal and scalable single-cell analysis. Nat Biotechnol. 2023; 42(2):293-304. PMC: 10928517. DOI: 10.1038/s41587-023-01767-y. View