» Articles » PMID: 38902799

Evaluation of Somatic Copy Number Variation Detection by NGS Technologies and Bioinformatics Tools on a Hyper-diploid Cancer Genome

Abstract

Background: Copy number variation (CNV) is a key genetic characteristic for cancer diagnostics and can be used as a biomarker for the selection of therapeutic treatments. Using data sets established in our previous study, we benchmark the performance of cancer CNV calling by six most recent and commonly used software tools on their detection accuracy, sensitivity, and reproducibility. In comparison to other orthogonal methods, such as microarray and Bionano, we also explore the consistency of CNV calling across different technologies on a challenging genome.

Results: While consistent results are observed for copy gain, loss, and loss of heterozygosity (LOH) calls across sequencing centers, CNV callers, and different technologies, variation of CNV calls are mostly affected by the determination of genome ploidy. Using consensus results from six CNV callers and confirmation from three orthogonal methods, we establish a high confident CNV call set for the reference cancer cell line (HCC1395).

Conclusions: NGS technologies and current bioinformatics tools can offer reliable results for detection of copy gain, loss, and LOH. However, when working with a hyper-diploid genome, some software tools can call excessive copy gain or loss due to inaccurate assessment of genome ploidy. With performance matrices on various experimental conditions, this study raises awareness within the cancer research community for the selection of sequencing platforms, sample preparation, sequencing coverage, and the choice of CNV detection tools.

References
1.
Xiao C, Chen Z, Chen W, Padilla C, Colgan M, Wu W . Personalized genome assembly for accurate cancer somatic mutation discovery using tumor-normal paired reference samples. Genome Biol. 2022; 23(1):237. PMC: 9648002. DOI: 10.1186/s13059-022-02803-x. View

2.
Zaccaria S, Raphael B . Accurate quantification of copy-number aberrations and whole-genome duplications in multi-sample tumor sequencing data. Nat Commun. 2020; 11(1):4301. PMC: 7468132. DOI: 10.1038/s41467-020-17967-y. View

3.
Xiao W, Ren L, Chen Z, Fang L, Zhao Y, Lack J . Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing. Nat Biotechnol. 2021; 39(9):1141-1150. PMC: 8506910. DOI: 10.1038/s41587-021-00994-5. View

4.
Raine K, Van Loo P, Wedge D, Jones D, Menzies A, Butler A . ascatNgs: Identifying Somatically Acquired Copy-Number Alterations from Whole-Genome Sequencing Data. Curr Protoc Bioinformatics. 2016; 56:15.9.1-15.9.17. PMC: 6097604. DOI: 10.1002/cpbi.17. View

5.
Newell F, Wilmott J, Johansson P, Nones K, Addala V, Mukhopadhyay P . Whole-genome sequencing of acral melanoma reveals genomic complexity and diversity. Nat Commun. 2020; 11(1):5259. PMC: 7567804. DOI: 10.1038/s41467-020-18988-3. View