Atlas-scale Single-cell Multi-sample Multi-condition Data Integration Using ScMerge2
Overview
Affiliations
The recent emergence of multi-sample multi-condition single-cell multi-cohort studies allows researchers to investigate different cell states. The effective integration of multiple large-cohort studies promises biological insights into cells under different conditions that individual studies cannot provide. Here, we present scMerge2, a scalable algorithm that allows data integration of atlas-scale multi-sample multi-condition single-cell studies. We have generalized scMerge2 to enable the merging of millions of cells from single-cell studies generated by various single-cell technologies. Using a large COVID-19 data collection with over five million cells from 1000+ individuals, we demonstrate that scMerge2 enables multi-sample multi-condition scRNA-seq data integration from multiple cohorts and reveals signatures derived from cell-type expression that are more accurate in discriminating disease progression. Further, we demonstrate that scMerge2 can remove dataset variability in CyTOF, imaging mass cytometry and CITE-seq experiments, demonstrating its applicability to a broad spectrum of single-cell profiling technologies.
Baumann A, Ahmadi N, Wolfien M Methods Mol Biol. 2024; 2883:31-51.
PMID: 39702703 DOI: 10.1007/978-1-0716-4290-0_2.
Capture of Totipotency in Mouse Embryonic Stem Cells in the Absence of Pdzk1.
Zhang W, Zhao Y, Yang Z, Yan J, Wang H, Nie S Adv Sci (Weinh). 2024; 12(6):e2408852.
PMID: 39630006 PMC: 11809344. DOI: 10.1002/advs.202408852.
scCTS: identifying the cell type-specific marker genes from population-level single-cell RNA-seq.
Chen L, Guo Z, Deng T, Wu H Genome Biol. 2024; 25(1):269.
PMID: 39402623 PMC: 11472465. DOI: 10.1186/s13059-024-03410-8.
C-ziptf: stable tensor factorization for zero-inflated multi-dimensional genomics data.
Chafamo D, Shanmugam V, Tokcan N BMC Bioinformatics. 2024; 25(1):323.
PMID: 39369208 PMC: 11456250. DOI: 10.1186/s12859-024-05886-4.
Spatiotemporal metabolomic approaches to the cancer-immunity panorama: a methodological perspective.
Xiao Y, Li Y, Zhao H Mol Cancer. 2024; 23(1):202.
PMID: 39294747 PMC: 11409752. DOI: 10.1186/s12943-024-02113-9.