» Articles » PMID: 28662076

Landscape and Variation of Novel Retroduplications in 26 Human Populations

Overview
Specialty Biology
Date 2017 Jun 30
PMID 28662076
Citations 17
Authors
Affiliations
Soon will be listed here.
Abstract

Retroduplications come from reverse transcription of mRNAs and their insertion back into the genome. Here, we performed comprehensive discovery and analysis of retroduplications in a large cohort of 2,535 individuals from 26 human populations, as part of 1000 Genomes Phase 3. We developed an integrated approach to discover novel retroduplications combining high-coverage exome and low-coverage whole-genome sequencing data, utilizing information from both exon-exon junctions and discordant paired-end reads. We found 503 parent genes having novel retroduplications absent from the reference genome. Based solely on retroduplication variation, we built phylogenetic trees of human populations; these represent superpopulation structure well and indicate that variable retroduplications are effective population markers. We further identified 43 retroduplication parent genes differentiating superpopulations. This group contains several interesting insertion events, including a SLMO2 retroduplication and insertion into CAV3, which has a potential disease association. We also found retroduplications to be associated with a variety of genomic features: (1) Insertion sites were correlated with regular nucleosome positioning. (2) They, predictably, tend to avoid conserved functional regions, such as exons, but, somewhat surprisingly, also avoid introns. (3) Retroduplications tend to be co-inserted with young L1 elements, indicating recent retrotranspositional activity, and (4) they have a weak tendency to originate from highly expressed parent genes. Our investigation provides insight into the functional impact and association with genomic elements of retroduplications. We anticipate our approach and analytical methodology to have application in a more clinical context, where exome sequencing data is abundant and the discovery of retroduplications can potentially improve the accuracy of SNP calling.

Citing Articles

Quantitative Analysis of Pseudogene-Associated Errors During Germline Variant Calling.

Podvalnyi A, Kopernik A, Sayganova M, Woroncow M, Zobkova G, Smirnova A Int J Mol Sci. 2025; 26(1.

PMID: 39796219 PMC: 11719938. DOI: 10.3390/ijms26010363.


Interchromosomal Colocalization with Parental Genes Is Linked to the Function and Evolution of Mammalian Retrocopies.

Yan Y, Tian Y, Wu Z, Zhang K, Yang R Mol Biol Evol. 2023; 40(12).

PMID: 38060983 PMC: 10733166. DOI: 10.1093/molbev/msad265.


Discovery of non-reference processed pseudogenes in the Swedish population.

Ten Berk de Boer E, Saether K, Eisfeldt J Front Genet. 2023; 14:1176626.

PMID: 37323659 PMC: 10267823. DOI: 10.3389/fgene.2023.1176626.


Ancient segmentally duplicated LCORL retrocopies in equids.

Batcher K, Varney S, Raudsepp T, Jevit M, Dickinson P, Jagannathan V PLoS One. 2023; 18(6):e0286861.

PMID: 37289743 PMC: 10249811. DOI: 10.1371/journal.pone.0286861.


Recent, full-length gene retrocopies are common in canids.

Batcher K, Varney S, York D, Blacksmith M, Kidd J, Rebhun R Genome Res. 2022; 32(8):1602-1611.

PMID: 35961775 PMC: 9435743. DOI: 10.1101/gr.276828.122.


References
1.
Abecasis G, Auton A, Brooks L, DePristo M, Durbin R, Handsaker R . An integrated map of genetic variation from 1,092 human genomes. Nature. 2012; 491(7422):56-65. PMC: 3498066. DOI: 10.1038/nature11632. View

2.
Helman E, Lawrence M, Stewart C, Sougnez C, Getz G, Meyerson M . Somatic retrotransposition in human cancer revealed by whole-genome and exome sequencing. Genome Res. 2014; 24(7):1053-63. PMC: 4079962. DOI: 10.1101/gr.163659.113. View

3.
HALDANE J . The mutation rate of the gene for haemophilia, and its segregation ratios in males and females. Ann Eugen. 2010; 13(4):262-71. DOI: 10.1111/j.1469-1809.1946.tb02367.x. View

4.
Sisu C, Pei B, Leng J, Frankish A, Zhang Y, Balasubramanian S . Comparative analysis of pseudogenes across three phyla. Proc Natl Acad Sci U S A. 2014; 111(37):13361-6. PMC: 4169933. DOI: 10.1073/pnas.1407293111. View

5.
Shimodaira H . An approximately unbiased test of phylogenetic tree selection. Syst Biol. 2002; 51(3):492-508. DOI: 10.1080/10635150290069913. View