» Articles » PMID: 37066420

Integrating Multiple Single-cell Multi-omics Samples with Smmit

Overview
Journal bioRxiv
Date 2023 Apr 17
PMID 37066420
Authors
Affiliations
Soon will be listed here.
Abstract

Multi-sample single-cell multi-omics datasets, which simultaneously measure multiple data modalities in the same cells across multiple samples, facilitate the study of gene expression, gene regulatory activities, and protein abundances on a population scale. We developed Smmit, a computational method for integrating data both across samples and modalities. Compared to existing methods, Smmit more effectively removes batch effects while preserving relevant biological information, resulting in superior integration outcomes. Additionally, Smmit is more computationally efficient and builds upon existing computational pipelines, requiring minimal effort for implementation. Smmit is an R software package that is freely available on Github: https://github.com/zji90/Smmit.

References
1.
Luecken M, Buttner M, Chaichoompu K, Danese A, Interlandi M, Mueller M . Benchmarking atlas-level data integration in single-cell genomics. Nat Methods. 2021; 19(1):41-50. PMC: 8748196. DOI: 10.1038/s41592-021-01336-8. View

2.
Stuart T, Srivastava A, Madad S, Lareau C, Satija R . Single-cell chromatin state analysis with Signac. Nat Methods. 2021; 18(11):1333-1341. PMC: 9255697. DOI: 10.1038/s41592-021-01282-5. View

3.
Ma S, Zhang B, LaFave L, Earl A, Chiang Z, Hu Y . Chromatin Potential Identified by Shared Single-Cell Profiling of RNA and Chromatin. Cell. 2020; 183(4):1103-1116.e20. PMC: 7669735. DOI: 10.1016/j.cell.2020.09.056. View

4.
Argelaguet R, Arnol D, Bredikhin D, Deloro Y, Velten B, Marioni J . MOFA+: a statistical framework for comprehensive integration of multi-modal single-cell data. Genome Biol. 2020; 21(1):111. PMC: 7212577. DOI: 10.1186/s13059-020-02015-1. View

5.
. Year in review 2023. Nat Methods. 2024; 21(1):1-2. DOI: 10.1038/s41592-023-02158-6. View