Reevaluating Human Gene Annotation: a Second-generation Analysis of Chromosome 22
Overview
Authors
Affiliations
We report a second-generation gene annotation of human chromosome 22. Using expressed sequence databases, comparative sequence analysis, and experimental verification, we have extended genes, fused previously fragmented structures, and identified new genes. The total length in exons of annotation was increased by 74% over our previously published annotation and includes 546 protein-coding genes and 234 pseudogenes. Thirty-two potential protein-coding annotations are partial copies of other genes, and may represent duplications on an evolutionary path to change or loss of function. We also identified 31 non-protein-coding transcripts, including 16 possible antisense RNAs. By extrapolation, we estimate the human genome contains 29,000-36,000 protein-coding genes, 21,300 pseudogenes, and 1500 antisense RNAs. We suggest that our revised annotation criteria provide a paradigm for future annotation of the human genome.
Significant Association Between Adiponutrin and Hepatocellular Carcinoma Risk.
Li H, Liu F, Zhu H, Zhou X, Lu J, Chang H Medicine (Baltimore). 2015; 94(47):e2019.
PMID: 26632699 PMC: 5058968. DOI: 10.1097/MD.0000000000002019.
Developmental transcriptome analysis of human erythropoiesis.
Shi L, Lin Y, Sierant M, Zhu F, Cui S, Guan Y Hum Mol Genet. 2014; 23(17):4528-42.
PMID: 24781209 PMC: 4119405. DOI: 10.1093/hmg/ddu167.
Evidence for transcript networks composed of chimeric RNAs in human cells.
Djebali S, Lagarde J, Kapranov P, Lacroix V, Borel C, Mudge J PLoS One. 2012; 7(1):e28213.
PMID: 22238572 PMC: 3251577. DOI: 10.1371/journal.pone.0028213.
Hoglund P, Nordstrom K, Schioth H, Fredriksson R Mol Biol Evol. 2010; 28(4):1531-41.
PMID: 21186191 PMC: 3058773. DOI: 10.1093/molbev/msq350.
The characteristics of human genes: analysis of human chromosome 22.
Dunham I, Beare D, Collins J Comp Funct Genomics. 2008; 4(6):635-46.
PMID: 18629020 PMC: 2447302. DOI: 10.1002/cfg.335.