» Articles » PMID: 37352010

A Case for Accelerating Standards to Achieve the FAIR Principles of Environmental Health Research Experimental Data

Overview
Date 2023 Jun 23
PMID 37352010
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Funding agencies, publishers, and other stakeholders are pushing environmental health science investigators to improve data sharing; to promote the findable, accessible, interoperable, and reusable (FAIR) principles; and to increase the rigor and reproducibility of the data collected. Accomplishing these goals will require significant cultural shifts surrounding data management and strategies to develop robust and reliable resources that bridge the technical challenges and gaps in expertise.

Objective: In this commentary, we examine the current state of managing data and metadata-referred to collectively as (meta)data-in the experimental environmental health sciences. We introduce new tools and resources based on experiments to serve as examples for the broader field.

Methods: We discuss previous and ongoing efforts to improve (meta)data collection and curation. These include global efforts by the Functional Genomics Data Society to develop metadata collection tools such as the Investigation, Study, Assay (ISA) framework, and the Center for Expanded Data Annotation and Retrieval. We also conduct a case study of data deposited in the Gene Expression Omnibus that demonstrates the current state of environmental health data and highlights the value of using the tools we propose to support data deposition.

Discussion: The environmental health science community has played a key role in efforts to achieve the goals of the FAIR guiding principles and is well positioned to advance them further. We present a proposed framework to further promote these objectives and minimize the obstacles between data producers and data scientists to maximize the return on research investments. https://doi.org/10.1289/EHP11484.

Citing Articles

A minimal metadata set (MNMS) to repurpose nonclinical in vivo data for biomedical research.

Moresis A, Restivo L, Bromilow S, Flik G, Rosati G, Scorrano F Lab Anim (NY). 2024; 53(3):67-79.

PMID: 38438748 PMC: 10912024. DOI: 10.1038/s41684-024-01335-0.

References
1.
Wallach J, Boyack K, Ioannidis J . Reproducible research practices, transparency, and open access data in the biomedical literature, 2015-2017. PLoS Biol. 2018; 16(11):e2006930. PMC: 6245499. DOI: 10.1371/journal.pbio.2006930. View

2.
Grossman R, Abel B, Angiuoli S, Barrett J, Bassett D, Bramlett K . Collaborating to Compete: Blood Profiling Atlas in Cancer (BloodPAC) Consortium. Clin Pharmacol Ther. 2017; 101(5):589-592. PMC: 5525192. DOI: 10.1002/cpt.666. View

3.
Nigam R, Munzenmaier D, Worthey E, Dwinell M, Shimoyama M, Jacob H . Rat Strain Ontology: structured controlled vocabulary designed to facilitate access to strain data at RGD. J Biomed Semantics. 2013; 4(1):36. PMC: 4177145. DOI: 10.1186/2041-1480-4-36. View

4.
Heacock M, Amolegbe S, Skalla L, Trottier B, Carlin D, Henry H . Sharing SRP data to reduce environmentally associated disease and promote transdisciplinary research. Rev Environ Health. 2020; 35(2):111-122. DOI: 10.1515/reveh-2019-0089. View

5.
Bhhatarai B, Walters W, Hop C, Lanza G, Ekins S . Opportunities and challenges using artificial intelligence in ADME/Tox. Nat Mater. 2019; 18(5):418-422. PMC: 6594826. DOI: 10.1038/s41563-019-0332-5. View