» Articles » PMID: 21479260

A Computational Framework Discovers New Copy Number Variants with Functional Importance

Overview
Journal PLoS One
Date 2011 Apr 12
PMID 21479260
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

Structural variants which cause changes in copy numbers constitute an important component of genomic variability. They account for 0.7% of genomic differences in two individual genomes, of which copy number variants (CNVs) are the largest component. A recent population-based CNV study revealed the need of better characterization of CNVs, especially the small ones (<500 bp).We propose a three step computational framework (Identification of germline Changes in Copy Number or IgC2N) to discover and genotype germline CNVs. First, we detect candidate CNV loci by combining information across multiple samples without imposing restrictions to the number of coverage markers or to the variant size. Secondly, we fine tune the detection of rare variants and infer the putative copy number classes for each locus. Last, for each variant we combine the relative distance between consecutive copy number classes with genetic information in a novel attempt to estimate the reference model bias. This computational approach is applied to genome-wide data from 1250 HapMap individuals. Novel variants were discovered and characterized in terms of size, minor allele frequency, type of polymorphism (gains, losses or both), and mechanism of formation. Using data generated for a subset of individuals by a 42 million marker platform, we validated the majority of the variants with the highest validation rate (66.7%) was for variants of size larger than 1 kb. Finally, we queried transcriptomic data from 129 individuals determined by RNA-sequencing as further validation and to assess the functional role of the new variants. We investigated the possible enrichment for variant's regulatory effect and found that smaller variants (<1 Kb) are more likely to regulate gene transcript than larger variants (p-value = 2.04e-08). Our results support the validity of the computational framework to detect novel variants relevant to disease susceptibility studies and provide evidence of the importance of genetic variants in regulatory network studies.

Citing Articles

Copy Number Variations and Gene Mutations Identified by Multiplex Ligation-Dependent Probe Amplification in Romanian Chronic Lymphocytic Leukemia Patients.

Balla B, Tripon F, Candea M, Banescu C J Pers Med. 2023; 13(8).

PMID: 37623489 PMC: 10455273. DOI: 10.3390/jpm13081239.


A Mild PUM1 Mutation Is Associated with Adult-Onset Ataxia, whereas Haploinsufficiency Causes Developmental Delay and Seizures.

Gennarino V, Palmer E, McDonell L, Wang L, Adamski C, Koire A Cell. 2018; 172(5):924-936.e11.

PMID: 29474920 PMC: 5832058. DOI: 10.1016/j.cell.2018.02.006.


NUDT21-spanning CNVs lead to neuropsychiatric disease and altered MeCP2 abundance via alternative polyadenylation.

Gennarino V, Alcott C, Chen C, Chaudhury A, Gillentine M, Rosenfeld J Elife. 2015; 4.

PMID: 26312503 PMC: 4586391. DOI: 10.7554/eLife.10782.


In-silico identification and functional validation of allele-dependent AR enhancers.

Garritano S, Romanel A, Ciribilli Y, Bisio A, Gavoci A, Inga A Oncotarget. 2015; 6(7):4816-28.

PMID: 25693204 PMC: 4467117. DOI: 10.18632/oncotarget.3019.


Copy number variation detection using next generation sequencing read counts.

Wang H, Nettleton D, Ying K BMC Bioinformatics. 2014; 15:109.

PMID: 24731174 PMC: 4021345. DOI: 10.1186/1471-2105-15-109.


References
1.
Pickrell J, Marioni J, Pai A, Degner J, Engelhardt B, Nkadori E . Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature. 2010; 464(7289):768-72. PMC: 3089435. DOI: 10.1038/nature08872. View

2.
McCarroll S, Kuruvilla F, Korn J, Cawley S, Nemesh J, Wysoker A . Integrated detection and population-genetic analysis of SNPs and copy number variation. Nat Genet. 2008; 40(10):1166-74. DOI: 10.1038/ng.238. View

3.
Conrad D, Pinto D, Redon R, Feuk L, Gokcumen O, Zhang Y . Origins and functional impact of copy number variation in the human genome. Nature. 2009; 464(7289):704-12. PMC: 3330748. DOI: 10.1038/nature08516. View

4.
Lee C, Morton C . Structural genomic variation and personalized medicine. N Engl J Med. 2008; 358(7):740-1. DOI: 10.1056/NEJMcibr0708452. View

5.
Barnes C, Plagnol V, Fitzgerald T, Redon R, Marchini J, Clayton D . A robust statistical method for case-control association testing with copy number variation. Nat Genet. 2008; 40(10):1245-52. PMC: 2784596. DOI: 10.1038/ng.206. View