» Articles » PMID: 26994912

GO Annotation in InterPro: Why Stability Does Not Indicate Accuracy in a Sea of Changing Annotations

Overview
Specialty Biology
Date 2016 Mar 21
PMID 26994912
Citations 10
Authors
Affiliations
Soon will be listed here.
Abstract

The removal of annotation from biological databases is often perceived as an indicator of erroneous annotation. As a corollary, annotation stability is considered to be a measure of reliability. However, diverse data-driven events can affect the stability of annotations in both primary protein sequence databases and the protein family databases that are built upon the sequence databases and used to help annotate them. Here, we describe some of these events and their consequences for the InterPro database, and demonstrate that annotation removal or reassignment is not always linked to incorrect annotation by the curator. Database URL: http://www.ebi.ac.uk/interpro.

Citing Articles

Defining the condensate landscape of fusion oncoproteins.

Tripathi S, Shirnekhi H, Gorman S, Chandra B, Baggett D, Park C Nat Commun. 2023; 14(1):6008.

PMID: 37770423 PMC: 10539325. DOI: 10.1038/s41467-023-41655-2.


Whole-Genome Sequence Analysis of an Endophytic Fungus sp. SPS-2 and Its Biosynthetic Potential of Bioactive Secondary Metabolites.

Tao J, Bai X, Zeng M, Li M, Hu Z, Hua Y Microorganisms. 2022; 10(9).

PMID: 36144391 PMC: 9503250. DOI: 10.3390/microorganisms10091789.


Microbial Strategies for Survival in the Glass Sponge .

Bayer K, Busch K, Kenchington E, Beazley L, Franzenburg S, Michels J mSystems. 2020; 5(4).

PMID: 32788407 PMC: 7426153. DOI: 10.1128/mSystems.00473-20.


Computational discovery of direct associations between GO terms and protein domains.

Alborzi S, Ritchie D, Devignes M BMC Bioinformatics. 2018; 19(Suppl 14):413.

PMID: 30453875 PMC: 6245584. DOI: 10.1186/s12859-018-2380-2.


InterPro in 2019: improving coverage, classification and access to protein sequence annotations.

Mitchell A, Attwood T, Babbitt P, Blum M, Bork P, Bridge A Nucleic Acids Res. 2018; 47(D1):D351-D360.

PMID: 30398656 PMC: 6323941. DOI: 10.1093/nar/gky1100.


References
1.
Sigrist C, de Castro E, Cerutti L, Cuche B, Hulo N, Bridge A . New and continuing developments at PROSITE. Nucleic Acids Res. 2012; 41(Database issue):D344-7. PMC: 3531220. DOI: 10.1093/nar/gks1067. View

2.
Attwood T, Coletta A, Muirhead G, Pavlopoulou A, Philippou P, Popov I . The PRINTS database: a fine-grained protein sequence annotation and analysis resource--its status in 2012. Database (Oxford). 2012; 2012:bas019. PMC: 3326521. DOI: 10.1093/database/bas019. View

3.
Weits D, Giuntoli B, Kosmacz M, Parlanti S, Hubberten H, Riegler H . Plant cysteine oxidases control the oxygen-dependent branch of the N-end-rule pathway. Nat Commun. 2014; 5:3425. PMC: 3959200. DOI: 10.1038/ncomms4425. View

4.
Letunic I, Doerks T, Bork P . SMART: recent updates, new developments and status in 2015. Nucleic Acids Res. 2014; 43(Database issue):D257-60. PMC: 4384020. DOI: 10.1093/nar/gku949. View

5.
Mitchell A, Chang H, Daugherty L, Fraser M, Hunter S, Lopez R . The InterPro protein families database: the classification resource after 15 years. Nucleic Acids Res. 2014; 43(Database issue):D213-21. PMC: 4383996. DOI: 10.1093/nar/gku1243. View