» Articles » PMID: 38290978

A Statistical Learning Method for Simultaneous Copy Number Estimation and Subclone Clustering with Single-cell Sequencing Data

Overview
Journal Genome Res
Specialty Genetics
Date 2024 Jan 30
PMID 38290978
Authors
Affiliations
Soon will be listed here.
Abstract

The availability of single-cell sequencing (SCS) enables us to assess intra-tumor heterogeneity and identify cellular subclones without the confounding effect of mixed cells. Copy number aberrations (CNAs) have been commonly used to identify subclones in SCS data using various clustering methods, as cells comprising a subpopulation are found to share a genetic profile. However, currently available methods may generate spurious results (e.g., falsely identified variants) in the procedure of CNA detection, thereby diminishing the accuracy of subclone identification within a large, complex cell population. In this study, we developed a subclone clustering method based on a fused lasso model, referred to as FLCNA, which can simultaneously detect CNAs in single-cell DNA sequencing (scDNA-seq) data. Spike-in simulations were conducted to evaluate the clustering and CNA detection performance of FLCNA, benchmarking it against existing copy number estimation methods (SCOPE, HMMcopy) in combination with commonly used clustering methods. Application of FLCNA to a scDNA-seq data set of breast cancer revealed different genomic variation patterns in neoadjuvant chemotherapy-treated samples and pretreated samples. We show that FLCNA is a practical and powerful method for subclone identification and CNA detection with scDNA-seq data.

Citing Articles

HapCNV: A Comprehensive Framework for CNV Detection in Low-input DNA Sequencing Data.

Yu X, Qin F, Liu S, Brown N, Lu Q, Cai G bioRxiv. 2025; .

PMID: 39763944 PMC: 11702719. DOI: 10.1101/2024.12.19.629494.


Mutational Landscapes of Normal Skin and Their Potential Implications in the Development of Skin Cancer: A Comprehensive Narrative Review.

Riew T, Kim Y J Clin Med. 2024; 13(16).

PMID: 39200957 PMC: 11355262. DOI: 10.3390/jcm13164815.

References
1.
Cariati F, Borrillo F, Shankar V, Nunziato M, DArgenio V, Tomaiuolo R . Dissecting Intra-Tumor Heterogeneity by the Analysis of Copy Number Variations in Single Cells: The Neuroblastoma Case Study. Int J Mol Sci. 2019; 20(4). PMC: 6412524. DOI: 10.3390/ijms20040893. View

2.
Baslan T, Hicks J . Unravelling biology and shifting paradigms in cancer with single-cell sequencing. Nat Rev Cancer. 2017; 17(9):557-569. DOI: 10.1038/nrc.2017.58. View

3.
Fischer K, Pflugfelder G . Putative Breast Cancer Driver Mutations in TBX3 Cause Impaired Transcriptional Repression. Front Oncol. 2015; 5:244. PMC: 4625211. DOI: 10.3389/fonc.2015.00244. View

4.
Deshwar A, Vembu S, Yung C, Jang G, Stein L, Morris Q . PhyloWGS: reconstructing subclonal composition and evolution from whole-genome sequencing of tumors. Genome Biol. 2015; 16:35. PMC: 4359439. DOI: 10.1186/s13059-015-0602-8. View

5.
Ha G, Roth A, Khattra J, Ho J, Yap D, Prentice L . TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data. Genome Res. 2014; 24(11):1881-93. PMC: 4216928. DOI: 10.1101/gr.180281.114. View