GROM-RD: Resolving Genomic Biases to Improve Read Depth Detection of Copy Number Variants
Overview
Environmental Health
General Medicine
Affiliations
Amplifications or deletions of genome segments, known as copy number variants (CNVs), have been associated with many diseases. Read depth analysis of next-generation sequencing (NGS) is an essential method of detecting CNVs. However, genome read coverage is frequently distorted by various biases of NGS platforms, which reduce predictive capabilities of existing approaches. Additionally, the use of read depth tools has been somewhat hindered by imprecise breakpoint identification. We developed GROM-RD, an algorithm that analyzes multiple biases in read coverage to detect CNVs in NGS data. We found non-uniform variance across distinct GC regions after using existing GC bias correction methods and developed a novel approach to normalize such variance. Although complex and repetitive genome segments complicate CNV detection, GROM-RD adjusts for repeat bias and uses a two-pipeline masking approach to detect CNVs in complex and repetitive segments while improving sensitivity in less complicated regions. To overcome a typical weakness of RD methods, GROM-RD employs a CNV search using size-varying overlapping windows to improve breakpoint resolution. We compared our method to two widely used programs based on read depth methods, CNVnator and RDXplorer, and observed improved CNV detection and breakpoint accuracy for GROM-RD. GROM-RD is available at http://grigoriev.rutgers.edu/software/.
Zhou M, Dong J, Jiang H, Zhao Z, Yuan T Sci Rep. 2025; 15(1):3526.
PMID: 39875521 PMC: 11775105. DOI: 10.1038/s41598-025-88143-9.
On the core segmentation algorithms of copy number variation detection tools.
Zhang Y, Liu W, Duan J Brief Bioinform. 2024; 25(2).
PMID: 38340093 PMC: 10858679. DOI: 10.1093/bib/bbae022.
Zhang T, Dong J, Jiang H, Zhao Z, Zhou M, Yuan T Front Bioeng Biotechnol. 2022; 10:1000638.
PMID: 36532569 PMC: 9751350. DOI: 10.3389/fbioe.2022.1000638.
CONGA: Copy number variation genotyping in ancient genomes and low-coverage sequencing data.
Soylev A, Cokoglu S, Koptekin D, Alkan C, Somel M PLoS Comput Biol. 2022; 18(12):e1010788.
PMID: 36516232 PMC: 9873172. DOI: 10.1371/journal.pcbi.1010788.
KNNCNV: A K-Nearest Neighbor Based Method for Detection of Copy Number Variations Using NGS Data.
Xie K, Liu K, Alvi H, Chen Y, Wang S, Yuan X Front Cell Dev Biol. 2022; 9:796249.
PMID: 35004691 PMC: 8728060. DOI: 10.3389/fcell.2021.796249.