» Articles » PMID: 36998245

Geographically Weighted Linear Combination Test for Gene-set Analysis of a Continuous Spatial Phenotype As Applied to Intratumor Heterogeneity

Overview
Specialty Cell Biology
Date 2023 Mar 31
PMID 36998245
Authors
Affiliations
Soon will be listed here.
Abstract

The impact of gene-sets on a spatial phenotype is not necessarily uniform across different locations of cancer tissue. This study introduces a computational platform, GWLCT, for combining gene set analysis with spatial data modeling to provide a new statistical test for location-specific association of phenotypes and molecular pathways in spatial single-cell RNA-seq data collected from an input tumor sample. The main advantage of GWLCT consists of an analysis beyond global significance, allowing the association between the gene-set and the phenotype to vary across the tumor space. At each location, the most significant linear combination is found using a geographically weighted shrunken covariance matrix and kernel function. Whether a fixed or adaptive bandwidth is determined based on a cross-validation cross procedure. Our proposed method is compared to the global version of linear combination test (LCT), bulk and random-forest based gene-set enrichment analyses using data created by the Visium Spatial Gene Expression technique on an invasive breast cancer tissue sample, as well as 144 different simulation scenarios. In an illustrative example, the new geographically weighted linear combination test, GWLCT, identifies the cancer hallmark gene-sets that are significantly associated at each location with the five spatially continuous phenotypic contexts in the tumors defined by different well-known markers of cancer-associated fibroblasts. Scan statistics revealed clustering in the number of significant gene-sets. A spatial heatmap of combined significance over all selected gene-sets is also produced. Extensive simulation studies demonstrate that our proposed approach outperforms other methods in the considered scenarios, especially when the spatial association increases. Our proposed approach considers the spatial covariance of gene expression to detect the most significant gene-sets affecting a continuous phenotype. It reveals spatially detailed information in tissue space and can thus play a key role in understanding the contextual heterogeneity of cancer cells.

References
1.
Vogelstein B, Kinzler K . The Path to Cancer --Three Strikes and You're Out. N Engl J Med. 2015; 373(20):1895-8. DOI: 10.1056/NEJMp1508811. View

2.
Puram S, Tirosh I, Parikh A, Patel A, Yizhak K, Gillespie S . Single-Cell Transcriptomic Analysis of Primary and Metastatic Tumor Ecosystems in Head and Neck Cancer. Cell. 2017; 171(7):1611-1624.e24. PMC: 5878932. DOI: 10.1016/j.cell.2017.10.044. View

3.
Raj B, Wagner D, McKenna A, Pandey S, Klein A, Shendure J . Simultaneous single-cell profiling of lineages and cell types in the vertebrate brain. Nat Biotechnol. 2018; 36(5):442-450. PMC: 5938111. DOI: 10.1038/nbt.4103. View

4.
Stahl P, Salmen F, Vickovic S, Lundmark A, Fernandez Navarro J, Magnusson J . Visualization and analysis of gene expression in tissue sections by spatial transcriptomics. Science. 2016; 353(6294):78-82. DOI: 10.1126/science.aaf2403. View

5.
Chien C, Chang C, Tsai C, Chen J . MAVTgsa: an R package for gene set (enrichment) analysis. Biomed Res Int. 2014; 2014:346074. PMC: 4101957. DOI: 10.1155/2014/346074. View