» Articles » PMID: 38834657

Clustering Single-cell RNA Sequencing Data Via Iterative Smoothing and Self-supervised Discriminative Embedding

Overview
Journal Oncogene
Date 2024 Jun 4
PMID 38834657
Authors
Affiliations
Soon will be listed here.
Abstract

Single-cell transcriptome sequencing (scRNA-seq) is a high-throughput technique used to study gene expression at the single-cell level. Clustering analysis is a commonly used method in scRNA-seq data analysis, helping researchers identify cell types and uncover interactions between cells. However, the choice of a robust similarity metric in the clustering procedure is still an open challenge due to the complex underlying structures of the data and the inherent noise in data acquisition. Here, we propose a deep clustering method for scRNA-seq data called scRISE (scRNA-seq Iterative Smoothing and self-supervised discriminative Embedding model) to resolve this challenge. The model consists of two main modules: an iterative smoothing module based on graph autoencoders designed to denoise the data and refine the pairwise similarity in turn to gradually incorporate cell structural features and enrich the data information; and a self-supervised discriminative embedding module with adaptive similarity threshold for partitioning samples into correct clusters. Our approach has shown improved quality of data representation and clustering on seventeen scRNA-seq datasets against a number of state-of-the-art deep learning clustering methods. Furthermore, utilizing the scRISE method in biological analysis against the HNSCC dataset has unveiled 62 informative genes, highlighting their potential roles as therapeutic targets and biomarkers.

Citing Articles

scDRMAE: integrating masked autoencoder with residual attention networks to leverage omics feature dependencies for accurate cell clustering.

Zhang T, Zhang H, Ren J, Wu Z, Zhao Z, Wang G Bioinformatics. 2024; 40(10).

PMID: 39404795 PMC: 11513018. DOI: 10.1093/bioinformatics/btae599.

References
1.
Choi J, Yarishkin O, Kim E, Bae Y, Kim A, Kim S . TWIK-1/TASK-3 heterodimeric channels contribute to the neurotensin-mediated excitation of hippocampal dentate gyrus granule cells. Exp Mol Med. 2018; 50(11):1-13. PMC: 6230555. DOI: 10.1038/s12276-018-0172-4. View

2.
Wen L, Li G, Huang T, Geng W, Pei H, Yang J . Single-cell technologies: From research to application. Innovation (Camb). 2022; 3(6):100342. PMC: 9637996. DOI: 10.1016/j.xinn.2022.100342. View

3.
Eraslan G, Drokhlyansky E, Anand S, Fiskin E, Subramanian A, Slyper M . Single-nucleus cross-tissue molecular reference maps toward understanding disease gene function. Science. 2022; 376(6594):eabl4290. PMC: 9383269. DOI: 10.1126/science.abl4290. View

4.
Iacono G, Mereu E, Guillaumet-Adkins A, Corominas R, Cusco I, Rodriguez-Esteban G . bigSCale: an analytical framework for big-scale single-cell data. Genome Res. 2018; 28(6):878-890. PMC: 5991513. DOI: 10.1101/gr.230771.117. View

5.
Pal B, Chen Y, Vaillant F, Jamieson P, Gordon L, Rios A . Construction of developmental lineage relationships in the mouse mammary gland by single-cell RNA profiling. Nat Commun. 2017; 8(1):1627. PMC: 5696379. DOI: 10.1038/s41467-017-01560-x. View