» Articles » PMID: 38605638

SEGCECO: Subgraph Embedding of Gene Expression Matrix for Prediction of CEll-cell COmmunication

Overview
Journal Brief Bioinform
Specialty Biology
Date 2024 Apr 12
PMID 38605638
Authors
Affiliations
Soon will be listed here.
Abstract

Recent advances in single-cell RNA sequencing technology have eased analyses of signaling networks of cells. Recently, cell-cell interaction has been studied based on various link prediction approaches on graph-structured data. These approaches have assumptions about the likelihood of node interaction, thus showing high performance for only some specific networks. Subgraph-based methods have solved this problem and outperformed other approaches by extracting local subgraphs from a given network. In this work, we present a novel method, called Subgraph Embedding of Gene expression matrix for prediction of CEll-cell COmmunication (SEGCECO), which uses an attributed graph convolutional neural network to predict cell-cell communication from single-cell RNA-seq data. SEGCECO captures the latent and explicit attributes of undirected, attributed graphs constructed from the gene expression profile of individual cells. High-dimensional and sparse single-cell RNA-seq data make converting the data into a graphical format a daunting task. We successfully overcome this limitation by applying SoptSC, a similarity-based optimization method in which the cell-cell communication network is built using a cell-cell similarity matrix which is learned from gene expression data. We performed experiments on six datasets extracted from the human and mouse pancreas tissue. Our comparative analysis shows that SEGCECO outperforms latent feature-based approaches, and the state-of-the-art method for link prediction, WLNM, with 0.99 ROC and 99% prediction accuracy. The datasets can be found at https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE84133 and the code is publicly available at Github https://github.com/sheenahora/SEGCECO and Code Ocean https://codeocean.com/capsule/8244724/tree.

References
1.
Ben-Kiki O, Bercovich A, Lifshitz A, Tanay A . Metacell-2: a divide-and-conquer metacell algorithm for scalable scRNA-seq analysis. Genome Biol. 2022; 23(1):100. PMC: 9019975. DOI: 10.1186/s13059-022-02667-1. View

2.
Wolf F, Angerer P, Theis F . SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018; 19(1):15. PMC: 5802054. DOI: 10.1186/s13059-017-1382-0. View

3.
Boisset J, Vivie J, Grun D, Muraro M, Lyubimova A, van Oudenaarden A . Mapping the physical network of cellular interactions. Nat Methods. 2018; 15(7):547-553. DOI: 10.1038/s41592-018-0009-z. View

4.
Jin S, Guerrero-Juarez C, Zhang L, Chang I, Ramos R, Kuan C . Inference and analysis of cell-cell communication using CellChat. Nat Commun. 2021; 12(1):1088. PMC: 7889871. DOI: 10.1038/s41467-021-21246-9. View

5.
Amezquita R, Lun A, Becht E, Carey V, Carpp L, Geistlinger L . Orchestrating single-cell analysis with Bioconductor. Nat Methods. 2019; 17(2):137-145. PMC: 7358058. DOI: 10.1038/s41592-019-0654-x. View