» Articles » PMID: 31028141

ScMerge Leverages Factor Analysis, Stable Expression, and Pseudoreplication to Merge Multiple Single-cell RNA-seq Datasets

Overview
Specialty Science
Date 2019 Apr 28
PMID 31028141
Citations 82
Authors
Affiliations
Soon will be listed here.
Abstract

Concerted examination of multiple collections of single-cell RNA sequencing (RNA-seq) data promises further biological insights that cannot be uncovered with individual datasets. Here we present scMerge, an algorithm that integrates multiple single-cell RNA-seq datasets using factor analysis of stably expressed genes and pseudoreplicates across datasets. Using a large collection of public datasets, we benchmark scMerge against published methods and demonstrate that it consistently provides improved cell type separation by removing unwanted factors; scMerge can also enhance biological discovery through robust data integration, which we show through the inference of development trajectory in a liver dataset collection.

Citing Articles

Single-Cell Sequencing: Genomic and Transcriptomic Approaches in Cancer Cell Biology.

Ortega-Batista A, Jaen-Alvarado Y, Moreno-Labrador D, Gomez N, Garcia G, Guerrero E Int J Mol Sci. 2025; 26(5).

PMID: 40076700 PMC: 11901077. DOI: 10.3390/ijms26052074.


Causal differential expression analysis under unmeasured confounders with causarray.

Du J, Shen M, Mathys H, Roeder K bioRxiv. 2025; .

PMID: 39975097 PMC: 11838442. DOI: 10.1101/2025.01.30.635593.


Spatially Resolved Multiomics: Data Analysis from Monoomics to Multiomics.

Huan C, Li J, Li Y, Zhao S, Yang Q, Zhang Z BME Front. 2025; 6():0084.

PMID: 39810754 PMC: 11725630. DOI: 10.34133/bmef.0084.


Advances and applications in single-cell and spatial genomics.

Wang J, Ye F, Chai H, Jiang Y, Wang T, Ran X Sci China Life Sci. 2025; .

PMID: 39792333 DOI: 10.1007/s11427-024-2770-x.


Recovery of biological signals lost in single-cell batch integration with CellANOVA.

Zhang Z, Mathew D, Lim T, Mason K, Martinez C, Huang S Nat Biotechnol. 2024; .

PMID: 39592777 DOI: 10.1038/s41587-024-02463-1.


References
1.
Eisenberg E, Levanon E . Human housekeeping genes are compact. Trends Genet. 2003; 19(7):362-5. DOI: 10.1016/S0168-9525(03)00140-9. View

2.
Hanchate N, Kondoh K, Lu Z, Kuang D, Ye X, Qiu X . Single-cell transcriptomics reveals receptor transformations during olfactory neurogenesis. Science. 2015; 350(6265):1251-5. PMC: 5642900. DOI: 10.1126/science.aad2456. View

3.
Johnson W, Li C, Rabinovic A . Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2006; 8(1):118-27. DOI: 10.1093/biostatistics/kxj037. View

4.
Clauset A, Newman M, Moore C . Finding community structure in very large networks. Phys Rev E Stat Nonlin Soft Matter Phys. 2005; 70(6 Pt 2):066111. DOI: 10.1103/PhysRevE.70.066111. View

5.
Bacher R, Chu L, Leng N, Gasch A, Thomson J, Stewart R . SCnorm: robust normalization of single-cell RNA-seq data. Nat Methods. 2017; 14(6):584-586. PMC: 5473255. DOI: 10.1038/nmeth.4263. View