» Articles » PMID: 36409836

Making Common Fund Data More Findable: Catalyzing a Data Ecosystem

Abstract

The Common Fund Data Ecosystem (CFDE) has created a flexible system of data federation that enables researchers to discover datasets from across the US National Institutes of Health Common Fund without requiring that data owners move, reformat, or rehost those data. This system is centered on a catalog that integrates detailed descriptions of biomedical datasets from individual Common Fund Programs' Data Coordination Centers (DCCs) into a uniform metadata model that can then be indexed and searched from a centralized portal. This Crosscut Metadata Model (C2M2) supports the wide variety of data types and metadata terms used by individual DCCs and can readily describe nearly all forms of biomedical research data. We detail its use to ingest and index data from 11 DCCs.

Citing Articles

NCI Cancer Research Data Commons: Lessons Learned and Future State.

Kim E, Davidsen T, Davis-Dusenbery B, Baumann A, Maggio A, Chen Z Cancer Res. 2024; 84(9):1404-1409.

PMID: 38488510 PMC: 11063686. DOI: 10.1158/0008-5472.CAN-23-2730.


NCI Cancer Research Data Commons: Core Standards and Services.

Brady A, Charbonneau A, Grossman R, Creasy H, Renner R, Pihl T Cancer Res. 2024; 84(9):1384-1387.

PMID: 38488505 PMC: 11067691. DOI: 10.1158/0008-5472.CAN-23-2655.


The DO-KB Knowledgebase: a 20-year journey developing the disease open science ecosystem.

Baron J, Johnson C, Schor M, Olley D, Nickel L, Felix V Nucleic Acids Res. 2023; 52(D1):D1305-D1314.

PMID: 37953304 PMC: 10767934. DOI: 10.1093/nar/gkad1051.


Maximizing the utility of public data.

Ahmed M, Kim H, Kim D Front Genet. 2023; 14:1106631.

PMID: 37065493 PMC: 10102460. DOI: 10.3389/fgene.2023.1106631.


Ten lessons for data sharing with a data commons.

Grossman R Sci Data. 2023; 10(1):120.

PMID: 36878917 PMC: 9988927. DOI: 10.1038/s41597-023-02029-x.


References
1.
Kim S, Thiessen P, Bolton E, Chen J, Fu G, Gindulyte A . PubChem Substance and Compound databases. Nucleic Acids Res. 2015; 44(D1):D1202-13. PMC: 4702940. DOI: 10.1093/nar/gkv951. View

2.
Plante R, Becker C, Medina-Smith A, Brady K, Dima A, Long B . Implementing a Registry Federation for Materials Science Data Discovery. Data Sci J. 2021; 20. PMC: 8596377. DOI: 10.5334/dsj-2021-015. View

3.
Tiemeyer M, Aoki K, Paulson J, Cummings R, York W, Karlsson N . GlyTouCan: an accessible glycan structure repository. Glycobiology. 2017; 27(10):915-919. PMC: 5881658. DOI: 10.1093/glycob/cwx066. View

4.
Wilkinson M, Dumontier M, Aalbersberg I, Appleton G, Axton M, Baak A . The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016; 3:160018. PMC: 4792175. DOI: 10.1038/sdata.2016.18. View

5.
Howe K, Achuthan P, Allen J, Allen J, Alvarez-Jarreta J, Amode M . Ensembl 2021. Nucleic Acids Res. 2020; 49(D1):D884-D891. PMC: 7778975. DOI: 10.1093/nar/gkaa942. View