» Articles » PMID: 21085617

Systematic Inference of Copy-number Genotypes from Personal Genome Sequencing Data Reveals Extensive Olfactory Receptor Gene Content Diversity

Overview
Specialty Biology
Date 2010 Nov 19
PMID 21085617
Citations 40
Authors
Affiliations
Soon will be listed here.
Abstract

Copy-number variations (CNVs) are widespread in the human genome, but comprehensive assignments of integer locus copy-numbers (i.e., copy-number genotypes) that, for example, enable discrimination of homozygous from heterozygous CNVs, have remained challenging. Here we present CopySeq, a novel computational approach with an underlying statistical framework that analyzes the depth-of-coverage of high-throughput DNA sequencing reads, and can incorporate paired-end and breakpoint junction analysis based CNV-analysis approaches, to infer locus copy-number genotypes. We benchmarked CopySeq by genotyping 500 chromosome 1 CNV regions in 150 personal genomes sequenced at low-coverage. The assessed copy-number genotypes were highly concordant with our performed qPCR experiments (Pearson correlation coefficient 0.94), and with the published results of two microarray platforms (95-99% concordance). We further demonstrated the utility of CopySeq for analyzing gene regions enriched for segmental duplications by comprehensively inferring copy-number genotypes in the CNV-enriched >800 olfactory receptor (OR) human gene and pseudogene loci. CopySeq revealed that OR loci display an extensive range of locus copy-numbers across individuals, with zero to two copies in some OR loci, and two to nine copies in others. Among genetic variants affecting OR loci we identified deleterious variants including CNVs and SNPs affecting ~15% and ~20% of the human OR gene repertoire, respectively, implying that genetic variants with a possible impact on smell perception are widespread. Finally, we found that for several OR loci the reference genome appears to represent a minor-frequency variant, implying a necessary revision of the OR repertoire for future functional studies. CopySeq can ascertain genomic structural variation in specific gene families as well as at a genome-wide scale, where it may enable the quantitative evaluation of CNVs in genome-wide association studies involving high-throughput sequencing.

Citing Articles

Copy number variations in autistic children.

Alhazmi S, Alharthi M, Alzahrani M, Alrofaidi A, Basingab F, Almuhammadi A Biomed Rep. 2024; 21(1):107.

PMID: 38868529 PMC: 11168027. DOI: 10.3892/br.2024.1795.


Human subsistence and signatures of selection on chemosensory genes.

Veilleux C, Garrett E, Pajic P, Saitou M, Ochieng J, Dagsaan L Commun Biol. 2023; 6(1):683.

PMID: 37400713 PMC: 10317983. DOI: 10.1038/s42003-023-05047-y.


Clinical and molecular characterization of COVID-19 hospitalized patients.

Benetti E, Giliberti A, Emiliozzi A, Valentino F, Bergantini L, Fallerini C PLoS One. 2020; 15(11):e0242534.

PMID: 33206719 PMC: 7673557. DOI: 10.1371/journal.pone.0242534.


Genome-wide scan for selection signatures reveals novel insights into the adaptive capacity in local North African cattle.

Ben-Jemaa S, Mastrangelo S, Lee S, Lee J, Boussaha M Sci Rep. 2020; 10(1):19466.

PMID: 33173134 PMC: 7655849. DOI: 10.1038/s41598-020-76576-3.


Genome-wide detection of copy number variations in polled yak using the Illumina BovineHD BeadChip.

Jia C, Wang H, Li C, Wu X, Zan L, Ding X BMC Genomics. 2019; 20(1):376.

PMID: 31088363 PMC: 6518677. DOI: 10.1186/s12864-019-5759-1.


References
1.
Korbel J, Abyzov A, Mu X, Carriero N, Cayting P, Zhang Z . PEMer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data. Genome Biol. 2009; 10(2):R23. PMC: 2688268. DOI: 10.1186/gb-2009-10-2-r23. View

2.
Zhang F, Gu W, Hurles M, Lupski J . Copy number variation in human health, disease, and evolution. Annu Rev Genomics Hum Genet. 2009; 10:451-81. PMC: 4472309. DOI: 10.1146/annurev.genom.9.081307.164217. View

3.
She X, Jiang Z, Clark R, Liu G, Cheng Z, Tuzun E . Shotgun sequence assembly and recent segmental duplications within the human genome. Nature. 2004; 431(7011):927-30. DOI: 10.1038/nature03062. View

4.
Feuk L, Carson A, Scherer S . Structural variation in the human genome. Nat Rev Genet. 2006; 7(2):85-97. DOI: 10.1038/nrg1767. View

5.
Lam H, Mu X, Stutz A, Tanzer A, Cayting P, Snyder M . Nucleotide-resolution analysis of structural variants using BreakSeq and a breakpoint library. Nat Biotechnol. 2009; 28(1):47-55. PMC: 2951730. DOI: 10.1038/nbt.1600. View