Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis
Overview
Authors
Affiliations
Analysis of single-cell multiomics datasets is a novel topic and is considerably challenging because such datasets contain a large number of features with numerous missing values. In this study, we implemented a recently proposed tensor-decomposition (TD)-based unsupervised feature extraction (FE) technique to address this difficult problem. The technique can successfully integrate single-cell multiomics data composed of gene expression, DNA methylation, and accessibility. Although the last two have large dimensions, as many as ten million, containing only a few percentage of nonzero values, TD-based unsupervised FE can integrate three omics datasets without filling in missing values. Together with UMAP, which is used frequently when embedding single-cell measurements into two-dimensional space, TD-based unsupervised FE can produce two-dimensional embedding coincident with classification when integrating single-cell omics datasets. Genes selected based on TD-based unsupervised FE are also significantly related to reasonable biological roles.
Choi J, Park C, Chae H PeerJ. 2024; 12:e17006.
PMID: 38426141 PMC: 10903350. DOI: 10.7717/peerj.17006.
Taguchi Y, Turki T Front Artif Intell. 2023; 6:1237542.
PMID: 37719083 PMC: 10503044. DOI: 10.3389/frai.2023.1237542.
Fujita S, Karasawa Y, Hironaka K, Taguchi Y, Kuroda S PLoS One. 2023; 18(2):e0281594.
PMID: 36791130 PMC: 9931158. DOI: 10.1371/journal.pone.0281594.