» Articles » PMID: 14656962

Millions of Years of Evolution Preserved: a Comprehensive Catalog of the Processed Pseudogenes in the Human Genome

Overview
Journal Genome Res
Specialty Genetics
Date 2003 Dec 6
PMID 14656962
Citations 202
Authors
Affiliations
Soon will be listed here.
Abstract

Processed pseudogenes were created by reverse-transcription of mRNAs; they provide snapshots of ancient genes existing millions of years ago in the genome. To find them in the present-day human, we developed a pipeline using features such as intron-absence, frame-disruption, polyadenylation, and truncation. This has enabled us to identify in recent genome drafts approximately 8000 processed pseudogenes (distributed from http://pseudogene.org). Overall, processed pseudogenes are very similar to their closest corresponding human gene, being 94% complete in coding regions, with sequence similarity of 75% for amino acids and 86% for nucleotides. Their chromosomal distribution appears random and dispersed, with the numbers on chromosomes proportional to length, suggesting sustained "bombardment" over evolution. However, it does vary with GC-content: Processed pseudogenes occur mostly in intermediate GC-content regions. This is similar to Alus but contrasts with functional genes and L1-repeats. Pseudogenes, moreover, have age profiles similar to Alus. The number of pseudogenes associated with a given gene follows a power-law relationship, with a few genes giving rise to many pseudogenes and most giving rise to few. The prevalence of processed pseudogenes agrees well with germ-line gene expression. Highly expressed ribosomal proteins account for approximately 20% of the total. Other notables include cyclophilin-A, keratin, GAPDH, and cytochrome c.

Citing Articles

Identification of Retrocopies in Lepidoptera and Impact on Domestication of Silkworm.

Bie L, Sun J, Wang Y, Wang C Genes (Basel). 2025; 15(12.

PMID: 39766908 PMC: 11675541. DOI: 10.3390/genes15121641.


L1-ORF1p nucleoprotein can rapidly assume distinct conformations and simultaneously bind more than one nucleic acid.

Cashen B, Naufer M, Morse M, McCauley M, Rouzina I, Jones C Nucleic Acids Res. 2024; 52(22):14013-14029.

PMID: 39565204 PMC: 11662928. DOI: 10.1093/nar/gkae1141.


The reconstruction of evolutionary dynamics of processed pseudogenes indicates deep silencing of "retrobiome" in naked mole rat.

Kogan V, Molodtsov I, Fleyshman D, Leontieva O, Koman I, Gudkov A Proc Natl Acad Sci U S A. 2024; 121(45):e2313581121.

PMID: 39467133 PMC: 11551321. DOI: 10.1073/pnas.2313581121.


Exploring the evolving roles and clinical significance of circRNAs in head and neck squamous cell carcinoma.

Lei P, Guo Q, Hao J, Liu H, Chen Y, Wu F J Cancer. 2024; 15(12):3984-3994.

PMID: 38911371 PMC: 11190751. DOI: 10.7150/jca.96614.


Exploring the enigma: history, present, and future of long non-coding RNAs in cancer.

Naseer Q, Malik A, Zhang F, Chen S Discov Oncol. 2024; 15(1):214.

PMID: 38847897 PMC: 11161455. DOI: 10.1007/s12672-024-01077-y.


References
1.
Goncalves I, Duret L, Mouchiroud D . Nature and structure of human genes that generate retropseudogenes. Genome Res. 2000; 10(5):672-8. PMC: 310883. DOI: 10.1101/gr.10.5.672. View

2.
Feng Q, Moran J, Kazazian Jr H, Boeke J . Human L1 retrotransposon encodes a conserved endonuclease required for retrotransposition. Cell. 1996; 87(5):905-16. DOI: 10.1016/s0092-8674(00)81997-2. View

3.
Homma K, Fukuchi S, Kawabata T, Ota M, Nishikawa K . A systematic investigation identifies a significant number of probable pseudogenes in the Escherichia coli genome. Gene. 2002; 294(1-2):25-33. DOI: 10.1016/s0378-1119(02)00794-1. View

4.
Bensasson D, Petrov D, Zhang D, Hartl D, Hewitt G . Genomic gigantism: DNA loss is slow in mountain grasshoppers. Mol Biol Evol. 2001; 18(2):246-53. DOI: 10.1093/oxfordjournals.molbev.a003798. View

5.
Esnault C, Maestre J, Heidmann T . Human LINE retrotransposons generate processed pseudogenes. Nat Genet. 2000; 24(4):363-7. DOI: 10.1038/74184. View