Benchmarking Computational Doublet-Detection Methods for Single-Cell RNA Sequencing Data
Overview
Cell Biology
Molecular Biology
Authors
Affiliations
In single-cell RNA sequencing (scRNA-seq), doublets form when two cells are encapsulated into one reaction volume. The existence of doublets, which appear to be-but are not-real cells, is a key confounder in scRNA-seq data analysis. Computational methods have been developed to detect doublets in scRNA-seq data; however, the scRNA-seq field lacks a comprehensive benchmarking of these methods, making it difficult for researchers to choose an appropriate method for specific analyses. We conducted a systematic benchmark study of nine cutting-edge computational doublet-detection methods. Our study included 16 real datasets, which contained experimentally annotated doublets, and 112 realistic synthetic datasets. We compared doublet-detection methods regarding detection accuracy under various experimental settings, impacts on downstream analyses, and computational efficiencies. Our results show that existing methods exhibited diverse performance and distinct advantages in different aspects. Overall, the DoubletFinder method has the best detection accuracy, and the cxds method has the highest computational efficiency. A record of this paper's transparent peer review process is included in the Supplemental Information.
Improving doublet cell removal efficiency through multiple algorithm runs.
She Y, Wang C, Zhao Q Comput Struct Biotechnol J. 2025; 27:451-460.
PMID: 39911841 PMC: 11794957. DOI: 10.1016/j.csbj.2025.01.009.
Segmentation aware probabilistic phenotyping of single-cell spatial protein expression data.
Lee Y, Chen E, Chan D, Dinesh A, Afiuni-Zadeh S, Klamann C Nat Commun. 2025; 16(1):389.
PMID: 39755686 PMC: 11700195. DOI: 10.1038/s41467-024-55214-w.
ImageDoubler: image-based doublet identification in single-cell sequencing.
Deng K, Xu X, Zhou M, Li H, Keller E, Shelley G Nat Commun. 2025; 16(1):21.
PMID: 39747095 PMC: 11695948. DOI: 10.1038/s41467-024-55434-0.
Bellavance J, David L, Hildebrand M J Neurosci Res. 2024; 102(12):e70008.
PMID: 39673257 PMC: 11645520. DOI: 10.1002/jnr.70008.
demuxSNP: supervised demultiplexing single-cell RNA sequencing using cell hashing and SNPs.
Lynch M, Wang Y, Ho Sui S, Gatto L, Culhane A Gigascience. 2024; 13.
PMID: 39607981 PMC: 11604057. DOI: 10.1093/gigascience/giae090.