» Articles » PMID: 21293372

Mapping Copy Number Variation by Population-scale Genome Sequencing

Abstract

Genomic structural variants (SVs) are abundant in humans, differing from other forms of variation in extent, origin and functional impact. Despite progress in SV characterization, the nucleotide resolution architecture of most SVs remains unknown. We constructed a map of unbalanced SVs (that is, copy number variants) based on whole genome DNA sequencing data from 185 human genomes, integrating evidence from complementary SV discovery approaches with extensive experimental validations. Our map encompassed 22,025 deletions and 6,000 additional SVs, including insertions and tandem duplications. Most SVs (53%) were mapped to nucleotide resolution, which facilitated analysing their origin and functional impact. We examined numerous whole and partial gene deletions with a genotyping approach and observed a depletion of gene disruptions amongst high frequency deletions. Furthermore, we observed differences in the size spectra of SVs originating from distinct formation mechanisms, and constructed a map of SV hotspots formed by common mechanisms. Our analytical framework and SV map serves as a resource for sequencing-based association studies.

Citing Articles

Long-read sequencing of 945 Han individuals identifies structural variants associated with phenotypic diversity and disease susceptibility.

Gong J, Sun H, Wang K, Zhao Y, Huang Y, Chen Q Nat Commun. 2025; 16(1):1494.

PMID: 39929826 PMC: 11811171. DOI: 10.1038/s41467-025-56661-9.


Advancing long-read nanopore genome assembly and accurate variant calling for rare disease detection.

Negi S, Stenton S, Berger S, Canigiula P, McNulty B, Violich I Am J Hum Genet. 2025; 112(2):428-449.

PMID: 39862869 PMC: 11866955. DOI: 10.1016/j.ajhg.2025.01.002.


Diversity and consequences of structural variation in the human genome.

Collins R, Talkowski M Nat Rev Genet. 2025; .

PMID: 39838028 DOI: 10.1038/s41576-024-00808-9.


Replication stress increases de novo CNVs across the malaria parasite genome.

Brown N, Luniewski A, Yu X, Warthan M, Liu S, Zulawinska J bioRxiv. 2025; .

PMID: 39803504 PMC: 11722320. DOI: 10.1101/2024.12.19.629492.


Structural polymorphism and diversity of human segmental duplications.

Jeong H, Dishuck P, Yoo D, Harvey W, Munson K, Lewis A Nat Genet. 2025; 57(2):390-401.

PMID: 39779957 PMC: 11821543. DOI: 10.1038/s41588-024-02051-8.


References
1.
Harrow J, Denoeud F, Frankish A, Reymond A, Chen C, Chrast J . GENCODE: producing a reference annotation for ENCODE. Genome Biol. 2006; 7 Suppl 1:S4.1-9. PMC: 1810553. DOI: 10.1186/gb-2006-7-s1-s4. View

2.
Handsaker R, Korn J, Nemesh J, McCarroll S . Discovery and genotyping of genome structural polymorphism by sequencing on a population scale. Nat Genet. 2011; 43(3):269-76. PMC: 5094049. DOI: 10.1038/ng.768. View

3.
Yoon S, Xuan Z, Makarov V, Ye K, Sebat J . Sensitive and accurate detection of copy number variants using read depth of coverage. Genome Res. 2009; 19(9):1586-92. PMC: 2752127. DOI: 10.1101/gr.092981.109. View

4.
Levy S, Sutton G, Ng P, Feuk L, Halpern A, Walenz B . The diploid genome sequence of an individual human. PLoS Biol. 2007; 5(10):e254. PMC: 1964779. DOI: 10.1371/journal.pbio.0050254. View

5.
Willer C, Speliotes E, Loos R, Li S, Lindgren C, Heid I . Six new loci associated with body mass index highlight a neuronal influence on body weight regulation. Nat Genet. 2008; 41(1):25-34. PMC: 2695662. DOI: 10.1038/ng.287. View