» Articles » PMID: 39404807

MitoSort: Robust Demultiplexing of Pooled Single-cell Genomic Data Using Endogenous Mitochondrial Variants

Overview
Specialty Biology
Date 2024 Oct 15
PMID 39404807
Authors
Affiliations
Soon will be listed here.
Abstract

Multiplexing across donors has emerged as a popular strategy to increase throughput, reduce costs, overcome technical batch effects, and improve doublet detection in single-cell genomic studies. To eliminate additional experimental steps, endogenous nuclear genome variants are used for demultiplexing pooled single-cell RNA sequencing (scRNA-seq) data by several computational tools. However, these tools have limitations when applied to single-cell sequencing methods that do not cover nuclear genomic regions well, such as single-cell assay for transposase-accessible chromatin with sequencing (scATAC-seq). Here, we demonstrate that mitochondrial germline variants are an alternative, robust, and computationally efficient endogenous barcode for sample demultiplexing. We propose MitoSort, a tool that uses mitochondrial germline variants to assign cells to their donor origins and identify cross-genotype doublets in single-cell genomic datasets. We evaluate its performance by using in silico pooled mitochondrial scATAC-seq (mtscATAC-seq) libraries and experimentally multiplexed data with cell hashtags. MitoSort achieves high accuracy and efficiency in genotype clustering and doublet detection for mtscATAC-seq data, addressing the limitations of current computational techniques tailored for scRNA-seq data. Moreover, MitoSort exhibits versatility, and can be applied to various single-cell sequencing approaches beyond mtscATAC-seq provided that the mitochondrial variants are reliably detected. Furthermore, we demonstrate the application of MitoSort in a case study where B cells from eight donors were pooled and assayed by single-cell multi-omics sequencing. Altogether, our results demonstrate the accuracy and efficiency of MitoSort, which enables reliable sample demultiplexing in various single-cell genomic applications. MitoSort is available at https://github.com/tangzhj/MitoSort.

Citing Articles

The impact of ambient contamination on demultiplexing methods for single-nucleus multiome experiments.

Li T, Alvarez M, Liu C, Abuhanna K, Sun Y, Ernst J Res Sq. 2025; .

PMID: 39989953 PMC: 11844637. DOI: 10.21203/rs.3.rs-5977005/v1.


The impact of ambient contamination on demultiplexing methods for single-nucleus multiome experiments.

Li T, Alvarez M, Liu C, Abuhanna K, Sun Y, Ernst J bioRxiv. 2025; .

PMID: 39975005 PMC: 11839078. DOI: 10.1101/2025.02.06.636969.


Immunoregulatory programs in anti-N-methyl-D-aspartate receptor encephalitis identified by single-cell multi-omics analysis.

Li X, Xu Y, Zhang W, Chen Z, Peng D, Ren W Clin Transl Med. 2025; 15(1):e70173.

PMID: 39779473 PMC: 11710936. DOI: 10.1002/ctm2.70173.

References
1.
Neavin D, Senabouth A, Arora H, Lee J, Ripoll-Cladellas A, Franke L . Demuxafy: improvement in droplet assignment by integrating multiple single-cell demultiplexing and doublet detection methods. Genome Biol. 2024; 25(1):94. PMC: 11020463. DOI: 10.1186/s13059-024-03224-8. View

2.
Huang Y, McCarthy D, Stegle O . Vireo: Bayesian demultiplexing of pooled single-cell RNA-seq data without genotype reference. Genome Biol. 2019; 20(1):273. PMC: 6909514. DOI: 10.1186/s13059-019-1865-2. View

3.
Piwecka M, Rajewsky N, Rybak-Wolf A . Single-cell and spatial transcriptomics: deciphering brain complexity in health and disease. Nat Rev Neurol. 2023; 19(6):346-362. PMC: 10191412. DOI: 10.1038/s41582-023-00809-y. View

4.
Lareau C, Dubois S, Buquicchio F, Hsieh Y, Garg K, Kautz P . Single-cell multi-omics of mitochondrial DNA disorders reveals dynamics of purifying selection across human immune cells. Nat Genet. 2023; 55(7):1198-1209. PMC: 10548551. DOI: 10.1038/s41588-023-01433-8. View

5.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A . The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20(9):1297-303. PMC: 2928508. DOI: 10.1101/gr.107524.110. View