» Articles » PMID: 39400541

Statistical Batch-aware Embedded Integration, Dimension Reduction, and Alignment for Spatial Transcriptomics

Overview
Journal Bioinformatics
Specialty Biology
Date 2024 Oct 14
PMID 39400541
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Spatial transcriptomics (ST) technologies provide richer insights into the molecular characteristics of cells by simultaneously measuring gene expression profiles and their relative locations. However, each slice can only contain limited biological variation, and since there are almost always non-negligible batch effects across different slices, integrating numerous slices to account for batch effects and locations is not straightforward. Performing multi-slice integration, dimensionality reduction, and other downstream analyses separately often results in suboptimal embeddings for technical artifacts and biological variations. Joint modeling integrating these steps can enhance our understanding of the complex interplay between technical artifacts and biological signals, leading to more accurate and insightful results.

Results: In this context, we propose a hierarchical hidden Markov random field model STADIA to reduce batch effects, extract common biological patterns across multiple ST slices, and simultaneously identify spatial domains. We demonstrate the effectiveness of STADIA using five datasets from different species (human and mouse), various organs (brain, skin, and liver), and diverse platforms (10x Visium, ST, and Slice-seqV2). STADIA can capture common tissue structures across multiple slices and preserve slice-specific biological signals. In addition, STADIA outperforms the other three competing methods (PRECAST, fastMNN, and Harmony) in terms of the balance between batch mixing and spatial domain identification, and it demonstrates the advantage of joint modeling when compared to STAGATE and GraphST.

Availability And Implementation: The source code implemented by R is available at https://github.com/zhanglabtools/STADIA and archived with version 1.01 on Zenodo https://zenodo.org/records/13637744.

References
1.
Elosua-Bayes M, Nieto P, Mereu E, Gut I, Heyn H . SPOTlight: seeded NMF regression to deconvolute spatial transcriptomics spots with single-cell transcriptomes. Nucleic Acids Res. 2021; 49(9):e50. PMC: 8136778. DOI: 10.1093/nar/gkab043. View

2.
Andersson A, Lundeberg J . sepal: identifying transcript profiles with spatial patterns by diffusion-based modeling. Bioinformatics. 2021; 37(17):2644-2650. PMC: 8428601. DOI: 10.1093/bioinformatics/btab164. View

3.
Ma Y, Zhou X . Spatially informed cell-type deconvolution for spatial transcriptomics. Nat Biotechnol. 2022; 40(9):1349-1359. PMC: 9464662. DOI: 10.1038/s41587-022-01273-7. View

4.
Zhao E, Stone M, Ren X, Guenthoer J, Smythe K, Pulliam T . Spatial transcriptomics at subspot resolution with BayesSpace. Nat Biotechnol. 2021; 39(11):1375-1384. PMC: 8763026. DOI: 10.1038/s41587-021-00935-2. View

5.
Zhou X, Dong K, Zhang S . Integrating spatial transcriptomics data across different conditions, technologies and developmental stages. Nat Comput Sci. 2024; 3(10):894-906. DOI: 10.1038/s43588-023-00528-w. View