» Articles » PMID: 39344711

ScEGG: an Exogenous Gene-guided Clustering Method for Single-cell Transcriptomic Data

Overview
Journal Brief Bioinform
Specialty Biology
Date 2024 Sep 30
PMID 39344711
Authors
Affiliations
Soon will be listed here.
Abstract

In recent years, there has been significant advancement in the field of single-cell data analysis, particularly in the development of clustering methods. Despite these advancements, most algorithms continue to focus primarily on analyzing the provided single-cell matrix data. However, within medical contexts, single-cell data often encompasses a wealth of exogenous information, such as gene networks. Overlooking this aspect could result in information loss and produce clustering outcomes lacking significant clinical relevance. To address this limitation, we introduce an innovative deep clustering method for single-cell data that leverages exogenous gene information to generate discriminative cell representations. Specifically, an attention-enhanced graph autoencoder has been developed to efficiently capture topological signal patterns among cells. Concurrently, a random walk on an exogenous protein-protein interaction network enabled the acquisition of the gene's embeddings. Ultimately, the clustering process entailed integrating and reconstructing gene-cell cooperative embeddings, which yielded a discriminative representation. Extensive experiments have demonstrated the effectiveness of the proposed method. This research provides enhanced insights into the characteristics of cells, thus laying the foundation for the early diagnosis and treatment of diseases. The datasets and code can be publicly accessed in the repository at https://github.com/DayuHuu/scEGG.

References
1.
Liu X, Shen Q, Zhang S . Cross-species cell-type assignment from single-cell RNA-seq data by a heterogeneous graph neural network. Genome Res. 2022; 33(1):96-111. PMC: 9977153. DOI: 10.1101/gr.276868.122. View

2.
Zhang Y, Song J, Zhao Z, Yang M, Chen M, Liu C . Single-cell transcriptome analysis reveals tumor immune microenvironment heterogenicity and granulocytes enrichment in colorectal cancer liver metastases. Cancer Lett. 2019; 470:84-94. DOI: 10.1016/j.canlet.2019.10.016. View

3.
Qian S, Shi M, Wang D, Fear J, Chen L, Tu Y . Integrating massive RNA-seq data to elucidate transcriptome dynamics in Drosophila melanogaster. Brief Bioinform. 2023; 24(4). PMC: 10505420. DOI: 10.1093/bib/bbad177. View

4.
Kiselev V, Kirschner K, Schaub M, Andrews T, Yiu A, Chandra T . SC3: consensus clustering of single-cell RNA-seq data. Nat Methods. 2017; 14(5):483-486. PMC: 5410170. DOI: 10.1038/nmeth.4236. View

5.
Yamada K, Hamada M . Prediction of RNA-protein interactions using a nucleotide language model. Bioinform Adv. 2023; 2(1):vbac023. PMC: 9710633. DOI: 10.1093/bioadv/vbac023. View