» Articles » PMID: 14656963

A Genome-wide Survey of Human Pseudogenes

Overview
Journal Genome Res
Specialty Genetics
Date 2003 Dec 6
PMID 14656963
Citations 120
Authors
Affiliations
Soon will be listed here.
Abstract

We screened all intergenic regions in the human genome to identify pseudogenes with a combination of homology searches and a functionality test using the ratio of silent to replacement nucleotide substitutions (KA/KS). We identified 19,724 regions of which 95% +/- 3% are estimated to evolve neutrally and thus are likely to encode pseudogenes. Half of these have no detectable truncation in their pseudocoding regions and therefore are not identifiable by methods that require the presence of truncations to prove nonfunctionality. A comparative analysis with the mouse genome showed that 70% of these pseudogenes have a retrotranspositional origin (processed), and the rest arose by segmental duplication (nonprocessed). Although the spread of both types of pseudogenes correlates with chromosome size, nonprocessed pseudogenes appear to be enriched in regions with high gene density. It is likely that the human pseudogenes identified here represent only a small fraction of the total, which probably exceeds the number of genes.

Citing Articles

Loss to gain: pseudogenes in microorganisms, focusing on eubacteria, and their biological significance.

Yang Y, Wang P, El Qaidi S, Hardwidge P, Huang J, Zhu G Appl Microbiol Biotechnol. 2024; 108(1):328.

PMID: 38717672 PMC: 11078800. DOI: 10.1007/s00253-023-12971-w.


Functional Characterization of a Phf8 Processed Pseudogene in the Mouse Genome.

St-Germain J, Khan M, Bavykina V, Desmarais R, Scott M, Boissonneault G Genes (Basel). 2023; 14(1).

PMID: 36672913 PMC: 9859284. DOI: 10.3390/genes14010172.


CRISPR/Cas9-induced gene conversion between paralogs.

Yanovsky-Dagan S, Frumkin A, Lupski J, Harel T HGG Adv. 2022; 3(2):100092.

PMID: 35199044 PMC: 8844715. DOI: 10.1016/j.xhgg.2022.100092.


PΨFinder: a practical tool for the identification and visualization of novel pseudogenes in DNA sequencing data.

Abrahamsson S, Eiengard F, Rohlin A, Lopez M BMC Bioinformatics. 2022; 23(1):59.

PMID: 35114952 PMC: 8812246. DOI: 10.1186/s12859-022-04583-4.


Pseudogene-mediated DNA demethylation leads to oncogene activation.

Kwon J, Liu Y, Gao C, Bassal M, Jones A, Yang J Sci Adv. 2021; 7(40):eabg1695.

PMID: 34597139 PMC: 10938534. DOI: 10.1126/sciadv.abg1695.


References
1.
Mounsey A, Bauer P, Hope I . Evidence suggesting that a fifth of annotated Caenorhabditis elegans genes may be pseudogenes. Genome Res. 2002; 12(5):770-5. PMC: 186578. DOI: 10.1101/gr.208802. View

2.
Birney E, Durbin R . Dynamite: a flexible code generating language for dynamic programming methods used in sequence comparison. Proc Int Conf Intell Syst Mol Biol. 1997; 5:56-64. View

3.
Zhang Z, Harrison P, Gerstein M . Identification and analysis of over 2000 ribosomal protein pseudogenes in the human genome. Genome Res. 2002; 12(10):1466-82. PMC: 187539. DOI: 10.1101/gr.331902. View

4.
Prince V, Pickett F . Splitting pairs: the diverging fates of duplicated genes. Nat Rev Genet. 2002; 3(11):827-37. DOI: 10.1038/nrg928. View

5.
Waterston R, Lindblad-Toh K, Birney E, Rogers J, Abril J, Agarwal P . Initial sequencing and comparative analysis of the mouse genome. Nature. 2002; 420(6915):520-62. DOI: 10.1038/nature01262. View