» Articles » PMID: 37647650

GraphCpG: Imputation of Single-cell Methylomes Based on Locus-aware Neighboring Subgraphs

Overview
Journal Bioinformatics
Specialty Biology
Date 2023 Aug 30
PMID 37647650
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Single-cell DNA methylation sequencing can assay DNA methylation at single-cell resolution. However, incomplete coverage compromises related downstream analyses, outlining the importance of imputation techniques. With a rising number of cell samples in recent large datasets, scalable and efficient imputation models are critical to addressing the sparsity for genome-wide analyses.

Results: We proposed a novel graph-based deep learning approach to impute methylation matrices based on locus-aware neighboring subgraphs with locus-aware encoding orienting on one cell type. Merely using the CpGs methylation matrix, the obtained GraphCpG outperforms previous methods on datasets containing more than hundreds of cells and achieves competitive performance on smaller datasets, with subgraphs of predicted sites visualized by retrievable bipartite graphs. Besides better imputation performance with increasing cell number, it significantly reduces computation time and demonstrates improvement in downstream analysis.

Availability And Implementation: The source code is freely available at https://github.com/yuzhong-deng/graphcpg.git.

Citing Articles

Data-Driven Identification of Early Cancer-Associated Genes via Penalized Trans-Dimensional Hidden Markov Models.

Hajebi Khaniki S, Shokoohi F Biomolecules. 2025; 15(2).

PMID: 40001597 PMC: 11853217. DOI: 10.3390/biom15020294.

References
1.
Fan S, Chi W . Methods for genome-wide DNA methylation analysis in human cancer. Brief Funct Genomics. 2016; 15(6):432-442. DOI: 10.1093/bfgp/elw010. View

2.
Hou Y, Guo H, Cao C, Li X, Hu B, Zhu P . Single-cell triple omics sequencing reveals genetic, epigenetic, and transcriptomic heterogeneity in hepatocellular carcinomas. Cell Res. 2016; 26(3):304-19. PMC: 4783472. DOI: 10.1038/cr.2016.23. View

3.
Smallwood S, Lee H, Angermueller C, Krueger F, Saadeh H, Peat J . Single-cell genome-wide bisulfite sequencing for assessing epigenetic heterogeneity. Nat Methods. 2014; 11(8):817-820. PMC: 4117646. DOI: 10.1038/nmeth.3035. View

4.
Kretzmer H, Biran A, Purroy N, Lemvigh C, Clement K, Gruber M . Preneoplastic Alterations Define CLL DNA Methylome and Persist through Disease Progression and Therapy. Blood Cancer Discov. 2021; 2(1):54-69. PMC: 7888194. DOI: 10.1158/2643-3230.BCD-19-0058. View

5.
Angermueller C, Lee H, Reik W, Stegle O . DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning. Genome Biol. 2017; 18(1):67. PMC: 5387360. DOI: 10.1186/s13059-017-1189-z. View