» Articles » PMID: 33237286

UniProt: the Universal Protein Knowledgebase in 2021

Overview
Specialty Biochemistry
Date 2020 Nov 25
PMID 33237286
Citations 2981
Affiliations
Soon will be listed here.
Abstract

The aim of the UniProt Knowledgebase is to provide users with a comprehensive, high-quality and freely accessible set of protein sequences annotated with functional information. In this article, we describe significant updates that we have made over the last two years to the resource. The number of sequences in UniProtKB has risen to approximately 190 million, despite continued work to reduce sequence redundancy at the proteome level. We have adopted new methods of assessing proteome completeness and quality. We continue to extract detailed annotations from the literature to add to reviewed entries and supplement these in unreviewed entries with annotations provided by automated systems such as the newly implemented Association-Rule-Based Annotator (ARBA). We have developed a credit-based publication submission interface to allow the community to contribute publications and annotations to UniProt entries. We describe how UniProtKB responded to the COVID-19 pandemic through expert curation of relevant entries that were rapidly made available to the research community through a dedicated portal. UniProt resources are available under a CC-BY (4.0) license via the web at https://www.uniprot.org/.

Citing Articles

Therapeutic Mechanisms of Medicine Food Homology Plants in Alzheimer's Disease: Insights from Network Pharmacology, Machine Learning, and Molecular Docking.

Wen S, Han Y, Li Y, Zhan D Int J Mol Sci. 2025; 26(5).

PMID: 40076742 PMC: 11899993. DOI: 10.3390/ijms26052121.


Rampant Interkingdom Horizontal Gene Transfer in Pezizomycotina? An Updated Inspection of Anomalous Phylogenies.

Aguirre-Carvajal K, Cardenas S, Munteanu C, Armijos-Jaramillo V Int J Mol Sci. 2025; 26(5).

PMID: 40076423 PMC: 11898892. DOI: 10.3390/ijms26051795.


XGBMUT: Predicting the Functional Impact of Missense Mutations Using an Extreme Gradient Boost Classifier.

Pereira G, Da Conceicao L, Abrahim-Vieira B, Rodrigues C, Cabral L, Coelho R ACS Omega. 2025; 10(8):8349-8360.

PMID: 40060867 PMC: 11886911. DOI: 10.1021/acsomega.4c10179.


Exploring the active ingredients and potential mechanisms of Pingchan granules in Parkinson's disease treatment through network pharmacology and transcriptomics.

Xu Q, Wang Y, Wang C, Jiang S, Zhang B, Tian J Sci Rep. 2025; 15(1):7847.

PMID: 40050654 PMC: 11885611. DOI: 10.1038/s41598-025-91344-x.


Cellular Activity of CQWW Nullomer-Derived Peptides.

Shave S, Isaksson R, Pham N, Elliott R, Dawson J, Soudant J ACS Omega. 2025; 10(7):6794-6800.

PMID: 40028100 PMC: 11865978. DOI: 10.1021/acsomega.4c08860.


References
1.
Lock A, Harris M, Rutherford K, Hayles J, Wood V . Community curation in PomBase: enabling fission yeast experts to provide detailed, standardized, sharable annotation from research publications. Database (Oxford). 2020; 2020. PMC: 7192550. DOI: 10.1093/database/baaa028. View

2.
Moriya Y, Kawano S, Okuda S, Watanabe Y, Matsumoto M, Takami T . The jPOST environment: an integrated proteomics data repository and database. Nucleic Acids Res. 2018; 47(D1):D1218-D1224. PMC: 6324006. DOI: 10.1093/nar/gky899. View

3.
Karsch-Mizrachi I, Takagi T, Cochrane G . The international nucleotide sequence database collaboration. Nucleic Acids Res. 2017; 46(D1):D48-D51. PMC: 5753279. DOI: 10.1093/nar/gkx1097. View

4.
Patel R, Shah N, Jackson A, Ghosh R, Pawliczek P, Paithankar S . ClinGen Pathogenicity Calculator: a configurable system for assessing pathogenicity of genetic variants. Genome Med. 2017; 9(1):3. PMC: 5228115. DOI: 10.1186/s13073-016-0391-z. View

5.
Mitchell A, Attwood T, Babbitt P, Blum M, Bork P, Bridge A . InterPro in 2019: improving coverage, classification and access to protein sequence annotations. Nucleic Acids Res. 2018; 47(D1):D351-D360. PMC: 6323941. DOI: 10.1093/nar/gky1100. View