Recent, Full-length Gene Retrocopies Are Common in Canids
Overview
Authors
Affiliations
Gene retrocopies arise from the reverse transcription and insertion into the genome of processed mRNA transcripts. Although many retrocopies have acquired mutations that render them functionally inactive, most mammals retain active LINE-1 sequences capable of producing new retrocopies. New retrocopies, referred to as retro copy number variants (retroCNVs), may not be identified by standard variant calling techniques in high-throughput sequencing data. Although multiple functional retroCNVs have been associated with skeletal dysplasias in dogs, the full landscape of canid retroCNVs has not been characterized. Here, retroCNV discovery was performed on a whole-genome sequencing data set of 293 canids from 76 breeds. We identified retroCNV parent genes via the presence of mRNA-specific 30-mers, and then identified retroCNV insertion sites through discordant read analysis. In total, we resolved insertion sites for 1911 retroCNVs from 1179 parent genes, 1236 of which appeared identical to their parent genes. Dogs had on average 54.1 total retroCNVs and 1.4 private retroCNVs. We found evidence of expression in testes for 12% (14/113) of the retroCNVs identified in six Golden Retrievers, including four chimeric transcripts, and 97 retroCNVs also had significantly elevated across dog breeds, possibly indicating selection. We applied our approach to a subset of human genomes and detected an average of 4.2 retroCNVs per sample, highlighting a 13-fold relative increase of retroCNV frequency in dogs. Particularly in canids, retroCNVs are a largely unexplored source of genetic variation that can contribute to genome plasticity and that should be considered when investigating traits and diseases.
Large-scale genomic analysis of the domestic dog informs biological discovery.
Buckley R, Ostrander E Genome Res. 2024; 34(6):811-821.
PMID: 38955465 PMC: 11293549. DOI: 10.1101/gr.278569.123.
Duplications and Retrogenes Are Numerous and Widespread in Modern Canine Genomic Assemblies.
Nguyen A, Blacksmith M, Kidd J Genome Biol Evol. 2024; 16(7).
PMID: 38946312 PMC: 11259980. DOI: 10.1093/gbe/evae142.
Yan Y, Tian Y, Wu Z, Zhang K, Yang R Mol Biol Evol. 2023; 40(12).
PMID: 38060983 PMC: 10733166. DOI: 10.1093/molbev/msad265.
Special Issue: "Canine Genetics 2".
Leeb T Genes (Basel). 2023; 14(10).
PMID: 37895280 PMC: 10606197. DOI: 10.3390/genes14101930.
Current Classification of Canine Muscular Dystrophies and Identification of New Variants.
Shelton G, Minor K, Friedenberg S, Cullen J, Guo L, Mickelson J Genes (Basel). 2023; 14(8).
PMID: 37628610 PMC: 10454810. DOI: 10.3390/genes14081557.