» Articles » PMID: 39900656

Deep Learning Powered Single-cell Clustering Framework with Enhanced Accuracy and Stability

Overview
Journal Sci Rep
Date 2025 Feb 3
PMID 39900656
Authors
Affiliations
Soon will be listed here.
Abstract

Single-cell RNA sequencing (scRNA-seq) has revolutionized the field of cellular diversity research. Unsupervised clustering, a key technique in this exploration, allows for the identification of distinct cell types within a population. Graph-based deep clustering methods have shown promise in preserving the structural relationships between cells (nodes) within the data. However, these methods often neglect the inherent distribution of nodes in the graph, leading to incomplete representations of cell populations. Additionally, conventional graph convolutional networks (GCNs) can suffer from oversmoothing, a phenomenon where the network loses the ability to differentiate between samples with similar expression profiles. To address these limitations, we proposed scG-cluster, an innovative deep structural clustering method. This method incorporates two key innovations: (1) Dual-topology adjacency graph: scG-cluster integrates information about node distribution into the traditional adjacency graph used by GCNs. This enriches the graph representation by capturing the spatial relationships between cells in addition to their pairwise similarities. (2) Dual-topology adaptive graph convolutional network (TAGCN): The framework employs a TAGCN architecture with residual concatenation. This network utilizes an attention mechanism to dynamically weight features within the graph, focusing on the most informative aspects for clustering. Additionally, residual connections are implemented to combat oversmoothing, ensuring the network retains the ability to distinguish between subtle differences in cell expression profiles. Furthermore, scG-cluster iteratively refines the clustering centers, leading to enhanced stability and accuracy in the final cluster assignments. Extensive evaluations on six diverse scRNA-seq datasets demonstrate that scG-cluster consistently outperforms existing state-of-the-art methods in terms of both clustering accuracy and scalability. Ablation studies are also conducted to validate the significant contributions of both the residual connections and the attention mechanism to the overall performance of the model. The source code for scG-cluster is publicly available at https://github.com/xixi-wq/scG-cluster .

References
1.
Kharchenko P . The triumphs and limitations of computational methods for scRNA-seq. Nat Methods. 2021; 18(7):723-732. DOI: 10.1038/s41592-021-01171-x. View

2.
Macosko E, Basu A, Satija R, Nemesh J, Shekhar K, Goldman M . Highly Parallel Genome-wide Expression Profiling of Individual Cells Using Nanoliter Droplets. Cell. 2015; 161(5):1202-1214. PMC: 4481139. DOI: 10.1016/j.cell.2015.05.002. View

3.
Zeisel A, Munoz-Manchado A, Codeluppi S, Lonnerberg P, La Manno G, Jureus A . Brain structure. Cell types in the mouse cortex and hippocampus revealed by single-cell RNA-seq. Science. 2015; 347(6226):1138-42. DOI: 10.1126/science.aaa1934. View

4.
Hu H, Li Z, Li X, Yu M, Pan X . ScCAEs: deep clustering of single-cell RNA-seq via convolutional autoencoder embedding and soft K-means. Brief Bioinform. 2021; 23(1). DOI: 10.1093/bib/bbab321. View

5.
Zhang Y, Wang D, Peng M, Tang L, Ouyang J, Xiong F . Single-cell RNA sequencing in cancer research. J Exp Clin Cancer Res. 2021; 40(1):81. PMC: 7919320. DOI: 10.1186/s13046-021-01874-1. View