» Articles » PMID: 29608177

Batch Effects in Single-cell RNA-sequencing Data Are Corrected by Matching Mutual Nearest Neighbors

Overview
Journal Nat Biotechnol
Specialty Biotechnology
Date 2018 Apr 3
PMID 29608177
Citations 977
Authors
Affiliations
Soon will be listed here.
Abstract

Large-scale single-cell RNA sequencing (scRNA-seq) data sets that are produced in different laboratories and at different times contain batch effects that may compromise the integration and interpretation of the data. Existing scRNA-seq analysis methods incorrectly assume that the composition of cell populations is either known or identical across batches. We present a strategy for batch correction based on the detection of mutual nearest neighbors (MNNs) in the high-dimensional expression space. Our approach does not rely on predefined or equal population compositions across batches; instead, it requires only that a subset of the population be shared between batches. We demonstrate the superiority of our approach compared with existing methods by using both simulated and real scRNA-seq data sets. Using multiple droplet-based scRNA-seq data sets, we demonstrate that our MNN batch-effect-correction method can be scaled to large numbers of cells.

Citing Articles

A single-cell transcriptomic atlas reveals the cell differentiation trajectory and the response to virus invasion in swelling clove of garlic.

Gao S, Li F, Zeng Z, He Q, Mostafa H, Zhang S Hortic Res. 2025; 12(4):uhae365.

PMID: 40070403 PMC: 11894531. DOI: 10.1093/hr/uhae365.


An model for cardiac organoid production: The combined role of geometrical confinement and substrate stiffness.

Santoro R, Piacentini L, Vavassori C, Benzoni P, Colombo G, Banfi C Mater Today Bio. 2025; 31:101566.

PMID: 40061214 PMC: 11889630. DOI: 10.1016/j.mtbio.2025.101566.


Mosquito Cell Atlas: A single-nucleus transcriptomic atlas of the adult mosquito.

Goldman O, DeFoe A, Qi Y, Jiao Y, Weng S, Houri-Zeevi L bioRxiv. 2025; .

PMID: 40060408 PMC: 11888250. DOI: 10.1101/2025.02.25.639765.


MAEST: accurately spatial domain detection in spatial transcriptomics with graph masked autoencoder.

Zhu P, Shu H, Wang Y, Wang X, Zhao Y, Hu J Brief Bioinform. 2025; 26(2).

PMID: 40052440 PMC: 11886571. DOI: 10.1093/bib/bbaf086.


Focal adhesion in the tumour metastasis: from molecular mechanisms to therapeutic targets.

Liu Z, Zhang X, Ben T, Li M, Jin Y, Wang T Biomark Res. 2025; 13(1):38.

PMID: 40045379 PMC: 11884212. DOI: 10.1186/s40364-025-00745-7.


References
1.
Segerstolpe A, Palasantza A, Eliasson P, Andersson E, Andreasson A, Sun X . Single-Cell Transcriptome Profiling of Human Pancreatic Islets in Health and Type 2 Diabetes. Cell Metab. 2016; 24(4):593-607. PMC: 5069352. DOI: 10.1016/j.cmet.2016.08.020. View

2.
Tung P, Blischak J, Hsiao C, Knowles D, Burnett J, Pritchard J . Batch effects and the effective design of single-cell gene expression studies. Sci Rep. 2017; 7:39921. PMC: 5206706. DOI: 10.1038/srep39921. View

3.
Bendall S, Davis K, Amir E, Tadmor M, Simonds E, Chen T . Single-cell trajectory detection uncovers progression and regulatory coordination in human B cell development. Cell. 2014; 157(3):714-25. PMC: 4045247. DOI: 10.1016/j.cell.2014.04.005. View

4.
Angerer P, Haghverdi L, Buttner M, Theis F, Marr C, Buettner F . destiny: diffusion maps for large-scale single-cell data in R. Bioinformatics. 2015; 32(8):1241-3. DOI: 10.1093/bioinformatics/btv715. View

5.
Scialdone A, Tanaka Y, Jawaid W, Moignard V, Wilson N, Macaulay I . Resolving early mesoderm diversification through single-cell expression profiling. Nature. 2016; 535(7611):289-293. PMC: 4947525. DOI: 10.1038/nature18633. View