CNVkit: Genome-Wide Copy Number Detection and Visualization from Targeted DNA Sequencing
Overview
Affiliations
Germline copy number variants (CNVs) and somatic copy number alterations (SCNAs) are of significant importance in syndromic conditions and cancer. Massively parallel sequencing is increasingly used to infer copy number information from variations in the read depth in sequencing data. However, this approach has limitations in the case of targeted re-sequencing, which leaves gaps in coverage between the regions chosen for enrichment and introduces biases related to the efficiency of target capture and library preparation. We present a method for copy number detection, implemented in the software package CNVkit, that uses both the targeted reads and the nonspecifically captured off-target reads to infer copy number evenly across the genome. This combination achieves both exon-level resolution in targeted regions and sufficient resolution in the larger intronic and intergenic regions to identify copy number changes. In particular, we successfully inferred copy number at equivalent to 100-kilobase resolution genome-wide from a platform targeting as few as 293 genes. After normalizing read counts to a pooled reference, we evaluated and corrected for three sources of bias that explain most of the extraneous variability in the sequencing read depth: GC content, target footprint size and spacing, and repetitive sequences. We compared the performance of CNVkit to copy number changes identified by array comparative genomic hybridization. We packaged the components of CNVkit so that it is straightforward to use and provides visualizations, detailed reporting of significant features, and export options for integration into existing analysis pipelines. CNVkit is freely available from https://github.com/etal/cnvkit.
Hyeon D, Nam D, Shin H, Jeong J, Jung E, Cho S Mol Cancer. 2025; 24(1):77.
PMID: 40087745 DOI: 10.1186/s12943-025-02256-3.
Genomic characteristics and prognostic correlations in Chinese multiple myeloma patients.
Chen X, Luo T, Zhang W, Wang S, Zhu M, He H BMC Med Genomics. 2025; 18(1):50.
PMID: 40087669 DOI: 10.1186/s12920-025-02116-5.
MYC ecDNA promotes intratumour heterogeneity and plasticity in PDAC.
Fiorini E, Malinova A, Schreyer D, Pasini D, Bevere M, Alessio G Nature. 2025; .
PMID: 40074906 DOI: 10.1038/s41586-025-08721-9.
Establishing a cryopreserved biobank of living tumor tissues for drug sensitivity testing.
Chen P, Zhou J, Chu X, Feng Y, Zeng Q, Lei J Bioact Mater. 2025; 46:582-596.
PMID: 40061435 PMC: 11889390. DOI: 10.1016/j.bioactmat.2024.09.008.
Recurrent alterations are associated with esophageal adenocarcinoma brain metastases.
Lawson N, Ye L, Cho C, Zhao B, Mitchell T, Martin-Barrio I medRxiv. 2025; .
PMID: 40061311 PMC: 11888521. DOI: 10.1101/2025.02.19.25322558.