» Articles » PMID: 34573424

Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis

Overview
Journal Genes (Basel)
Publisher MDPI
Date 2021 Sep 28
PMID 34573424
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Analysis of single-cell multiomics datasets is a novel topic and is considerably challenging because such datasets contain a large number of features with numerous missing values. In this study, we implemented a recently proposed tensor-decomposition (TD)-based unsupervised feature extraction (FE) technique to address this difficult problem. The technique can successfully integrate single-cell multiomics data composed of gene expression, DNA methylation, and accessibility. Although the last two have large dimensions, as many as ten million, containing only a few percentage of nonzero values, TD-based unsupervised FE can integrate three omics datasets without filling in missing values. Together with UMAP, which is used frequently when embedding single-cell measurements into two-dimensional space, TD-based unsupervised FE can produce two-dimensional embedding coincident with classification when integrating single-cell omics datasets. Genes selected based on TD-based unsupervised FE are also significantly related to reasonable biological roles.

Citing Articles

moSCminer: a cell subtype classification framework based on the attention neural network integrating the single-cell multi-omics dataset on the cloud.

Choi J, Park C, Chae H PeerJ. 2024; 12:e17006.

PMID: 38426141 PMC: 10903350. DOI: 10.7717/peerj.17006.


Application note: TDbasedUFE and TDbasedUFEadv: bioconductor packages to perform tensor decomposition based unsupervised feature extraction.

Taguchi Y, Turki T Front Artif Intell. 2023; 6:1237542.

PMID: 37719083 PMC: 10503044. DOI: 10.3389/frai.2023.1237542.


Features extracted using tensor decomposition reflect the biological features of the temporal patterns of human blood multimodal metabolome.

Fujita S, Karasawa Y, Hironaka K, Taguchi Y, Kuroda S PLoS One. 2023; 18(2):e0281594.

PMID: 36791130 PMC: 9931158. DOI: 10.1371/journal.pone.0281594.

References
1.
Lee J, Hyeon D, Hwang D . Single-cell multiomics: technologies and data analysis methods. Exp Mol Med. 2020; 52(9):1428-1442. PMC: 8080692. DOI: 10.1038/s12276-020-0420-2. View

2.
Downs K, Martin G, Bishop J . Contrasting patterns of myc and N-myc expression during gastrulation of the mouse embryo. Genes Dev. 1989; 3(6):860-9. DOI: 10.1101/gad.3.6.860. View

3.
Huang D, Sherman B, Lempicki R . Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009; 4(1):44-57. DOI: 10.1038/nprot.2008.211. View

4.
Yu C, Cvetesic N, Hisler V, Gupta K, Ye T, Gazdag E . TBPL2/TFIIA complex establishes the maternal transcriptome through oocyte-specific promoter usage. Nat Commun. 2020; 11(1):6439. PMC: 7755920. DOI: 10.1038/s41467-020-20239-4. View

5.
Suzuki T, Abe K, Inoue A, Aoki F . Expression of c-MYC in nuclear speckles during mouse oocyte growth and preimplantation development. J Reprod Dev. 2009; 55(5):491-5. DOI: 10.1262/jrd.09-069a. View