» Articles » PMID: 38488510

NCI Cancer Research Data Commons: Lessons Learned and Future State

Abstract

More than ever, scientific progress in cancer research hinges on our ability to combine datasets and extract meaningful interpretations to better understand diseases and ultimately inform the development of better treatments and diagnostic tools. To enable the successful sharing and use of big data, the NCI developed the Cancer Research Data Commons (CRDC), providing access to a large, comprehensive, and expanding collection of cancer data. The CRDC is a cloud-based data science infrastructure that eliminates the need for researchers to download and store large-scale datasets by allowing them to perform analysis where data reside. Over the past 10 years, the CRDC has made significant progress in providing access to data and tools along with training and outreach to support the cancer research community. In this review, we provide an overview of the history and the impact of the CRDC to date, lessons learned, and future plans to further promote data sharing, accessibility, interoperability, and reuse. See related articles by Brady et al., p. 1384, Wang et al., p. 1388, and Pot et al., p. 1396.

Citing Articles

Robust Cluster Prediction Across Data Types Validates Association of Sex and Therapy Response in GBM.

Gibbs D, Cioffi G, Aguilar B, Waite K, Pan E, Mandel J Cancers (Basel). 2025; 17(3).

PMID: 39941811 PMC: 11815886. DOI: 10.3390/cancers17030445.


Usage of the National Cancer Institute Cancer Research Data Commons by Researchers: A Scoping Review of the Literature.

Chen Z, Kim E, Davidsen T, Barnholtz-Sloan J JCO Clin Cancer Inform. 2024; 8:e2400116.

PMID: 39536277 PMC: 11575903. DOI: 10.1200/CCI.24.00116.


NCI's Proteomic Data Commons: A Cloud-Based Proteomics Repository Empowering Comprehensive Cancer Analysis through Cross-Referencing with Genomic and Imaging Data.

Thangudu R, Holck M, Singhal D, Pilozzi A, Edwards N, Rudnick P Cancer Res Commun. 2024; 4(9):2480-2488.

PMID: 39225545 PMC: 11413857. DOI: 10.1158/2767-9764.CRC-24-0243.


NCI Cancer Research Data Commons: Resources to Share Key Cancer Data.

Wang Z, Davidsen T, Kuffel G, Addepalli K, Bell A, Casas-Silva E Cancer Res. 2024; 84(9):1388-1395.

PMID: 38488507 PMC: 11063687. DOI: 10.1158/0008-5472.CAN-23-2468.


NCI Cancer Research Data Commons: Core Standards and Services.

Brady A, Charbonneau A, Grossman R, Creasy H, Renner R, Pihl T Cancer Res. 2024; 84(9):1384-1387.

PMID: 38488505 PMC: 11067691. DOI: 10.1158/0008-5472.CAN-23-2655.


References
1.
Fedorov A, R Longabaugh W, Pot D, Clunie D, Pieper S, Aerts H . NCI Imaging Data Commons. Cancer Res. 2021; 81(16):4188-4193. PMC: 8373794. DOI: 10.1158/0008-5472.CAN-21-0950. View

2.
Lau J, Lehnert E, Sethi A, Malhotra R, Kaushik G, Onder Z . The Cancer Genomics Cloud: Collaborative, Reproducible, and Democratized-A New Paradigm in Large-Scale Computational Research. Cancer Res. 2017; 77(21):e3-e6. PMC: 5832960. DOI: 10.1158/0008-5472.CAN-17-0387. View

3.
Wang Z, Davidsen T, Kuffel G, Addepalli K, Bell A, Casas-Silva E . NCI Cancer Research Data Commons: Resources to Share Key Cancer Data. Cancer Res. 2024; 84(9):1388-1395. PMC: 11063687. DOI: 10.1158/0008-5472.CAN-23-2468. View

4.
Wilkinson M, Dumontier M, Aalbersberg I, Appleton G, Axton M, Baak A . The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016; 3:160018. PMC: 4792175. DOI: 10.1038/sdata.2016.18. View

5.
Brady A, Charbonneau A, Grossman R, Creasy H, Renner R, Pihl T . NCI Cancer Research Data Commons: Core Standards and Services. Cancer Res. 2024; 84(9):1384-1387. PMC: 11067691. DOI: 10.1158/0008-5472.CAN-23-2655. View