» Articles » PMID: 12529303

Reevaluating Human Gene Annotation: a Second-generation Analysis of Chromosome 22

Overview
Journal Genome Res
Specialty Genetics
Date 2003 Jan 17
PMID 12529303
Citations 28
Authors
Affiliations
Soon will be listed here.
Abstract

We report a second-generation gene annotation of human chromosome 22. Using expressed sequence databases, comparative sequence analysis, and experimental verification, we have extended genes, fused previously fragmented structures, and identified new genes. The total length in exons of annotation was increased by 74% over our previously published annotation and includes 546 protein-coding genes and 234 pseudogenes. Thirty-two potential protein-coding annotations are partial copies of other genes, and may represent duplications on an evolutionary path to change or loss of function. We also identified 31 non-protein-coding transcripts, including 16 possible antisense RNAs. By extrapolation, we estimate the human genome contains 29,000-36,000 protein-coding genes, 21,300 pseudogenes, and 1500 antisense RNAs. We suggest that our revised annotation criteria provide a paradigm for future annotation of the human genome.

Citing Articles

Significant Association Between Adiponutrin and Hepatocellular Carcinoma Risk.

Li H, Liu F, Zhu H, Zhou X, Lu J, Chang H Medicine (Baltimore). 2015; 94(47):e2019.

PMID: 26632699 PMC: 5058968. DOI: 10.1097/MD.0000000000002019.


Developmental transcriptome analysis of human erythropoiesis.

Shi L, Lin Y, Sierant M, Zhu F, Cui S, Guan Y Hum Mol Genet. 2014; 23(17):4528-42.

PMID: 24781209 PMC: 4119405. DOI: 10.1093/hmg/ddu167.


Evidence for transcript networks composed of chimeric RNAs in human cells.

Djebali S, Lagarde J, Kapranov P, Lacroix V, Borel C, Mudge J PLoS One. 2012; 7(1):e28213.

PMID: 22238572 PMC: 3251577. DOI: 10.1371/journal.pone.0028213.


The solute carrier families have a remarkably long evolutionary history with the majority of the human families present before divergence of Bilaterian species.

Hoglund P, Nordstrom K, Schioth H, Fredriksson R Mol Biol Evol. 2010; 28(4):1531-41.

PMID: 21186191 PMC: 3058773. DOI: 10.1093/molbev/msq350.


The characteristics of human genes: analysis of human chromosome 22.

Dunham I, Beare D, Collins J Comp Funct Genomics. 2008; 4(6):635-46.

PMID: 18629020 PMC: 2447302. DOI: 10.1002/cfg.335.


References
1.
Wiemann S, Weil B, Wellenreuther R, Gassenhuber J, Glassl S, Ansorge W . Toward a catalog of human genes and proteins: sequencing and analysis of 500 novel complete protein coding human cDNAs. Genome Res. 2001; 11(3):422-35. PMC: 311072. DOI: 10.1101/gr.gr1547r. View

2.
Dunham I, Maslen G . Use of ACEDB as a database for YAC library data management. Methods Mol Biol. 1996; 54:253-80. DOI: 10.1385/0-89603-313-9:253. View

3.
Wright F, Lemon W, Zhao W, Sears R, Zhuo D, Wang J . A draft annotation and overview of the human genome. Genome Biol. 2001; 2(7):RESEARCH0025. PMC: 55322. DOI: 10.1186/gb-2001-2-7-research0025. View

4.
Hogenesch J, Ching K, Batalov S, Su A, Walker J, Zhou Y . A comparison of the Celera and Ensembl predicted gene sets reveals little overlap in novel genes. Cell. 2001; 106(4):413-5. DOI: 10.1016/s0092-8674(01)00467-6. View

5.
Das M, Burge C, Park E, Colinas J, Pelletier J . Assessment of the total number of human transcription units. Genomics. 2001; 77(1-2):71-8. DOI: 10.1006/geno.2001.6620. View