Metrics for the Human Proteome Project 2013-2014 and Strategies for Finding Missing Proteins
Overview
Authors
Affiliations
One year ago the Human Proteome Project (HPP) leadership designated the baseline metrics for the Human Proteome Project to be based on neXtProt with a total of 13,664 proteins validated at protein evidence level 1 (PE1) by mass spectrometry, antibody-capture, Edman sequencing, or 3D structures. Corresponding chromosome-specific data were provided from PeptideAtlas, GPMdb, and Human Protein Atlas. This year, the neXtProt total is 15,646 and the other resources, which are inputs to neXtProt, have high-quality identifications and additional annotations for 14,012 in PeptideAtlas, 14,869 in GPMdb, and 10,976 in HPA. We propose to remove 638 genes from the denominator that are "uncertain" or "dubious" in Ensembl, UniProt/SwissProt, and neXtProt. That leaves 3844 "missing proteins", currently having no or inadequate documentation, to be found from a new denominator of 19,490 protein-coding genes. We present those tabulations and web links and discuss current strategies to find the missing proteins.
Expanding and Enriching the LncRNA Gene-Disease Landscape Using the GeneCaRNA Database.
Aggarwal S, Rosenblum C, Gould M, Ziman S, Barshir R, Zelig O Biomedicines. 2024; 12(6).
PMID: 38927512 PMC: 11202217. DOI: 10.3390/biomedicines12061305.
Protocol for Increasing the Sensitivity of MS-Based Protein Detection in Human Chorionic Villi.
Shkrigunov T, Pogodin P, Zgoda V, Larina O, Kisrieva Y, Klimenko M Curr Issues Mol Biol. 2022; 44(5):2069-2088.
PMID: 35678669 PMC: 9164042. DOI: 10.3390/cimb44050140.
Enhanced Validation of Antibodies Enables the Discovery of Missing Proteins.
Sivertsson A, Lindstrom E, Oksvold P, Katona B, Hikmet F, Vuu J J Proteome Res. 2020; 19(12):4766-4781.
PMID: 33170010 PMC: 7723238. DOI: 10.1021/acs.jproteome.0c00486.
A high-stringency blueprint of the human proteome.
Adhikari S, Nice E, Deutsch E, Lane L, Omenn G, Pennington S Nat Commun. 2020; 11(1):5301.
PMID: 33067450 PMC: 7568584. DOI: 10.1038/s41467-020-19045-9.
Omenn G, Lane L, Overall C, Cristea I, Corrales F, Lindskog C J Proteome Res. 2020; 19(12):4735-4746.
PMID: 32931287 PMC: 7718309. DOI: 10.1021/acs.jproteome.0c00485.