» Articles » PMID: 23221299

Targeted Journal Curation As a Method to Improve Data Currency at the Comparative Toxicogenomics Database

Overview
Specialty Biology
Date 2012 Dec 11
PMID 23221299
Citations 8
Authors
Affiliations
Soon will be listed here.
Abstract

The Comparative Toxicogenomics Database (CTD) is a public resource that promotes understanding about the effects of environmental chemicals on human health. CTD biocurators read the scientific literature and manually curate a triad of chemical-gene, chemical-disease and gene-disease interactions. Typically, articles for CTD are selected using a chemical-centric approach by querying PubMed to retrieve a corpus containing the chemical of interest. Although this technique ensures adequate coverage of knowledge about the chemical (i.e. data completeness), it does not necessarily reflect the most current state of all toxicological research in the community at large (i.e. data currency). Keeping databases current with the most recent scientific results, as well as providing a rich historical background from legacy articles, is a challenging process. To address this issue of data currency, CTD designed and tested a journal-centric approach of curation to complement our chemical-centric method. We first identified priority journals based on defined criteria. Next, over 7 weeks, three biocurators reviewed 2425 articles from three consecutive years (2009-2011) of three targeted journals. From this corpus, 1252 articles contained relevant data for CTD and 52 752 interactions were manually curated. Here, we describe our journal selection process, two methods of document delivery for the biocurators and the analysis of the resulting curation metrics, including data currency, and both intra-journal and inter-journal comparisons of research topics. Based on our results, we expect that curation by select journals can (i) be easily incorporated into the curation pipeline to complement our chemical-centric approach; (ii) build content more evenly for chemicals, genes and diseases in CTD (rather than biasing data by chemicals-of-interest); (iii) reflect developing areas in environmental health and (iv) improve overall data currency for chemicals, genes and diseases. Database URL: http://ctdbase.org/

Citing Articles

Integrating AI-powered text mining from PubTator into the manual curation workflow at the Comparative Toxicogenomics Database.

Wiegers T, Davis A, Wiegers J, Sciaky D, Barkalow F, Wyatt B Database (Oxford). 2025; 2025.

PMID: 39982792 PMC: 11844237. DOI: 10.1093/database/baaf013.


Comparative Toxicogenomics Database (CTD): update 2023.

Davis A, Wiegers T, Johnson R, Sciaky D, Wiegers J, Mattingly C Nucleic Acids Res. 2022; 51(D1):D1257-D1262.

PMID: 36169237 PMC: 9825590. DOI: 10.1093/nar/gkac833.


Comparative Toxicogenomics Database (CTD): update 2021.

Davis A, Grondin C, Johnson R, Sciaky D, Wiegers J, Wiegers T Nucleic Acids Res. 2020; 49(D1):D1138-D1143.

PMID: 33068428 PMC: 7779006. DOI: 10.1093/nar/gkaa891.


Leveraging the Comparative Toxicogenomics Database to Fill in Knowledge Gaps for Environmental Health: A Test Case for Air Pollution-induced Cardiovascular Disease.

Davis A, Wiegers T, Grondin C, Johnson R, Sciaky D, Wiegers J Toxicol Sci. 2020; 177(2):392-404.

PMID: 32663284 PMC: 7548289. DOI: 10.1093/toxsci/kfaa113.


The Comparative Toxicogenomics Database: update 2019.

Davis A, Grondin C, Johnson R, Sciaky D, McMorran R, Wiegers J Nucleic Acids Res. 2018; 47(D1):D948-D954.

PMID: 30247620 PMC: 6323936. DOI: 10.1093/nar/gky868.


References
1.
Davis A, Wiegers T, Rosenstein M, Mattingly C . MEDIC: a practical disease vocabulary used at the Comparative Toxicogenomics Database. Database (Oxford). 2012; 2012:bar065. PMC: 3308155. DOI: 10.1093/database/bar065. View

2.
Mattingly C, Rosenstein M, Davis A, Colby G, Forrest Jr J, Boyer J . The comparative toxicogenomics database: a cross-species resource for building chemical-gene interaction networks. Toxicol Sci. 2006; 92(2):587-95. PMC: 1586111. DOI: 10.1093/toxsci/kfl008. View

3.
Hirschman J, Berardini T, Drabkin H, Howe D . A MOD(ern) perspective on literature curation. Mol Genet Genomics. 2010; 283(5):415-25. PMC: 2854346. DOI: 10.1007/s00438-010-0525-8. View

4.
Dowell K, McAndrews-Hill M, Hill D, Drabkin H, Blake J . Integrating text mining into the MGI biocuration workflow. Database (Oxford). 2010; 2009:bap019. PMC: 2797454. DOI: 10.1093/database/bap019. View

5.
Bunt S, Grumbling G, Field H, Marygold S, Brown N, Millburn G . Directly e-mailing authors of newly published papers encourages community curation. Database (Oxford). 2012; 2012:bas024. PMC: 3342516. DOI: 10.1093/database/bas024. View