Critical Analysis of CCSD Data Quality
Overview
Medical Informatics
Authors
Affiliations
Systematization and classification of carbohydrates contribute greatly to development of modern biomedical sciences. CCSD (CarbBank) data constitute the significant part of nearly all existing carbohydrate databases. However, these data have not been verified from their original deposit. During the expansion of Bacterial Carbohydrate Structure Database (BCSDB) project, we checked CCSD data quality and found that about 35% of records contained errors. The CCSD data cannot be used without manual verification, while CCSD errors migrate from database to database.
Toukach P, Egorova K Sci Data. 2022; 9(1):131.
PMID: 35354826 PMC: 8968703. DOI: 10.1038/s41597-022-01186-9.
PubChem chemical structure standardization.
Hahnke V, Kim S, Bolton E J Cheminform. 2018; 10(1):36.
PMID: 30097821 PMC: 6086778. DOI: 10.1186/s13321-018-0293-8.
Carbohydrate structure database merged from bacterial, archaeal, plant and fungal parts.
Toukach P, Egorova K Nucleic Acids Res. 2015; 44(D1):D1229-36.
PMID: 26286194 PMC: 4702937. DOI: 10.1093/nar/gkv840.
Qrator: a web-based curation tool for glycan structures.
Eavenson M, Kochut K, Miller J, Ranzinger R, Tiemeyer M, Aoki K Glycobiology. 2014; 25(1):66-73.
PMID: 25165068 PMC: 4245907. DOI: 10.1093/glycob/cwu090.
Using databases and web resources for glycomics research.
Aoki-Kinoshita K Mol Cell Proteomics. 2013; 12(4):1036-45.
PMID: 23325765 PMC: 3617328. DOI: 10.1074/mcp.R112.026252.