Segmental Duplications in the Human Genome Reveal Details of Pseudogene Formation
Overview
Affiliations
Duplicated pseudogenes in the human genome are disabled copies of functioning parent genes. They result from block duplication events occurring throughout evolutionary history. Relatively recent duplications (with sequence similarity≥90% and length≥1 kb) are termed segmental duplications (SDs); here, we analyze the interrelationship of SDs and pseudogenes. We present a decision-tree approach to classify pseudogenes based on their (and their parents') characteristics in relation to SDs. The classification identifies 140 novel pseudogenes and makes possible improved annotation for the 3172 pseudogenes located in SDs. In particular, it reveals that many pseudogenes in SDs likely did not arise directly from parent genes, but are the result of a multi-step process. In these cases, the initial duplication or retrotransposition of a parent gene gives rise to a 'parent pseudogene', followed by further duplication creating duplicated-duplicated or duplicated-processed pseudogenes, respectively. Moreover, we can precisely identify these parent pseudogenes by overlap with ancestral SD loci. Finally, a comparison of nucleotide substitutions per site in a pseudogene with its surrounding SD region allows us to estimate the time difference between duplication and disablement events, and this suggests that most duplicated pseudogenes in SDs were likely disabled around the time of the original duplication.
Functional Characterization of a Phf8 Processed Pseudogene in the Mouse Genome.
St-Germain J, Khan M, Bavykina V, Desmarais R, Scott M, Boissonneault G Genes (Basel). 2023; 14(1).
PMID: 36672913 PMC: 9859284. DOI: 10.3390/genes14010172.
Garewal N, Pathania S, Bhatia G, Singh K J Adv Res. 2022; 42:17-28.
PMID: 35933092 PMC: 9788958. DOI: 10.1016/j.jare.2022.07.014.
Pseudogene Profiling for Cancer Subtype Classification.
Zhang Y, Zheng D Methods Mol Biol. 2021; 2324:307-317.
PMID: 34165723 DOI: 10.1007/978-1-0716-1503-4_19.
Computational Methods for Pseudogene Annotation Based on Sequence Homology.
Harrison P Methods Mol Biol. 2021; 2324:35-48.
PMID: 34165707 DOI: 10.1007/978-1-0716-1503-4_3.
Gene Fusions Derived by Transcriptional Readthrough are Driven by Segmental Duplication in Human.
McCartney A, Hyland E, Cormican P, Moran R, Webb A, Lee K Genome Biol Evol. 2019; 11(9):2678-2690.
PMID: 31400206 PMC: 6764479. DOI: 10.1093/gbe/evz163.