Automatization and Self-maintenance of the O-GlcNAcome Catalog: a Smart Scientific Database
Overview
Authors
Affiliations
Post-translational modifications (PTMs) are ubiquitous and essential for protein function and signaling, motivating the need for sustainable benefit and open models of web databases. Highly conserved O-GlcNAcylation is a case example of one of the most recently discovered PTMs, investigated by a growing community. Historically, details about O-GlcNAcylated proteins and sites were dispersed across literature and in non-O-GlcNAc-focused, rapidly outdated or now defunct web databases. In a first effort to fill the gap, we recently published a human O-GlcNAcome catalog with a basic web interface. Based on the enthusiasm generated by this first resource, we extended our O-GlcNAcome catalog to include data from 42 distinct organisms and released the O-GlcNAc Database v1.2. In this version, more than 14 500 O-GlcNAcylated proteins and 11 000 O-GlcNAcylation sites are referenced from the curation of 2200 publications. In this article, we also present the extensive features of the O-GlcNAc Database, including the user-friendly interface, back-end and client-server interactions. We particularly emphasized our workflow, involving a mostly automatized and self-maintained database, including machine learning approaches for text mining. We hope that this software model will be useful beyond the O-GlcNAc community, to set up new smart, scientific online databases, in a short period of time. Indeed, this database system can be administrated with little to no programming skills and is meant to be an example of a useful, sustainable and cost-efficient resource, which exclusively relies on free open-source software elements (www.oglcnac.mcw.edu).
Neuronal activity-driven O-GlcNAcylation promotes mitochondrial plasticity.
Yu S, Wang H, Sanchez R, Carlson N, Nguyen K, Zhang A Dev Cell. 2024; 59(16):2143-2157.e9.
PMID: 38843836 PMC: 11338717. DOI: 10.1016/j.devcel.2024.05.008.
Yang T, Wang C, Tsai H, Yang Y, Liu C Comput Struct Biotechnol J. 2022; 20:4636-4644.
PMID: 36090812 PMC: 9449546. DOI: 10.1016/j.csbj.2022.08.041.
-GlcNAcylation: The Underestimated Emerging Regulators of Skeletal Muscle Physiology.
Liu Y, Hu Y, Fan W, Quan X, Xu B, Li S Cells. 2022; 11(11).
PMID: 35681484 PMC: 9180116. DOI: 10.3390/cells11111789.
Massman L, Pereckas M, Zwagerman N, Olivier-Van Stichelen S Endocrinology. 2021; 162(12).
PMID: 34418053 PMC: 8482966. DOI: 10.1210/endocr/bqab178.