Visualizing the Spatial Gene Expression Organization in the Brain Through Non-linear Similarity Embeddings
Overview
Authors
Affiliations
The Allen Brain Atlases enable the study of spatially resolved, genome-wide gene expression patterns across the mammalian brain. Several explorative studies have applied linear dimensionality reduction methods such as Principal Component Analysis (PCA) and classical Multi-Dimensional Scaling (cMDS) to gain insight into the spatial organization of these expression patterns. In this paper, we describe a non-linear embedding technique called Barnes-Hut Stochastic Neighbor Embedding (BH-SNE) that emphasizes the local similarity structure of high-dimensional data points. By applying BH-SNE to the gene expression data from the Allen Brain Atlases, we demonstrate the consistency of the 2D, non-linear embedding of the sagittal and coronal mouse brain atlases, and across 6 human brains. In addition, we quantitatively show that BH-SNE maps are superior in their separation of neuroanatomical regions in comparison to PCA and cMDS. Finally, we assess the effect of higher-order principal components on the global structure of the BH-SNE similarity maps. Based on our observations, we conclude that BH-SNE maps with or without prior dimensionality reduction (based on PCA) provide comprehensive and intuitive insights in both the local and global spatial transcriptome structure of the human and mouse Allen Brain Atlases.
Bledsoe X, Gamazon E Am J Hum Genet. 2024; 111(8):1559-1572.
PMID: 38925120 PMC: 11339608. DOI: 10.1016/j.ajhg.2024.06.002.
Using Global t-SNE to Preserve Intercluster Data Structure.
Zhou Y, Sharpee T Neural Comput. 2022; 34(8):1637-1651.
PMID: 35798323 PMC: 10010455. DOI: 10.1162/neco_a_01504.
Mullins R, Kapogiannis D Front Neurosci. 2022; 16:908650.
PMID: 35774552 PMC: 9237461. DOI: 10.3389/fnins.2022.908650.
Workflow for high-dimensional flow cytometry analysis of T cells from tumor metastases.
Faccani C, Rotta G, Clemente F, Fedeli M, Abbati D, Manfredi F Life Sci Alliance. 2022; 5(10).
PMID: 35724271 PMC: 9166301. DOI: 10.26508/lsa.202101316.
An unsupervised machine learning approach to evaluate sports facilities condition in primary school.
Xia J, Wang J, Chen H, Zhuang J, Cao Z, Chen P PLoS One. 2022; 17(4):e0267009.
PMID: 35443011 PMC: 9020747. DOI: 10.1371/journal.pone.0267009.