» Articles » PMID: 30071110

Populating the Data Ark: An Attempt to Retrieve, Preserve, and Liberate Data from the Most Highly-cited Psychology and Psychiatry Articles

Overview
Journal PLoS One
Date 2018 Aug 3
PMID 30071110
Citations 23
Authors
Affiliations
Soon will be listed here.
Abstract

The vast majority of scientific articles published to-date have not been accompanied by concomitant publication of the underlying research data upon which they are based. This state of affairs precludes the routine re-use and re-analysis of research data, undermining the efficiency of the scientific enterprise, and compromising the credibility of claims that cannot be independently verified. It may be especially important to make data available for the most influential studies that have provided a foundation for subsequent research and theory development. Therefore, we launched an initiative-the Data Ark-to examine whether we could retrospectively enhance the preservation and accessibility of important scientific data. Here we report the outcome of our efforts to retrieve, preserve, and liberate data from 111 of the most highly-cited articles published in psychology and psychiatry between 2006-2011 (n = 48) and 2014-2016 (n = 63). Most data sets were not made available (76/111, 68%, 95% CI [60, 77]), some were only made available with restrictions (20/111, 18%, 95% CI [10, 27]), and few were made available in a completely unrestricted form (15/111, 14%, 95% CI [5, 22]). Where extant data sharing systems were in place, they usually (17/22, 77%, 95% CI [54, 91]) did not allow unrestricted access. Authors reported several barriers to data sharing, including issues related to data ownership and ethical concerns. The Data Ark initiative could help preserve and liberate important scientific data, surface barriers to data sharing, and advance community discussions on data stewardship.

Citing Articles

How will we prepare for an uncertain future? The value of open data and code for unborn generations facing climate change.

Gomes D Proc Biol Sci. 2025; 292(2040):20241515.

PMID: 39933586 PMC: 11813590. DOI: 10.1098/rspb.2024.1515.


Expanding the data Ark: an attempt to make the data from highly cited social science papers publicly available.

Dulitzki C, Crane S, Hardwicke T, Ioannidis J R Soc Open Sci. 2024; 11(5):240016.

PMID: 39076822 PMC: 11285638. DOI: 10.1098/rsos.240016.


Perceptions and Opinions Towards Data-Sharing: A Survey of Addiction Journal Editorial Board Members.

Anderson J, Johnson A, Rauh S, Johnson B, Bouvette M, Pinero I J Sci Pract Integr. 2024; 2022.

PMID: 38804666 PMC: 11129878. DOI: 10.35122/001c.35597.


Industry Involvement and Transparency in the Most Cited Clinical Trials, 2019-2022.

Siena L, Papamanolis L, Siebert M, Bellomo R, Ioannidis J JAMA Netw Open. 2023; 6(11):e2343425.

PMID: 37962883 PMC: 10646728. DOI: 10.1001/jamanetworkopen.2023.43425.


Care to share? Experimental evidence on code sharing behavior in the social sciences.

Krahmer D, Schachtele L, Schneck A PLoS One. 2023; 18(8):e0289380.

PMID: 37549146 PMC: 10406284. DOI: 10.1371/journal.pone.0289380.


References
1.
Naudet F, Sakarovitch C, Janiaud P, Cristea I, Fanelli D, Moher D . Data sharing and reanalysis of randomized controlled trials in leading biomedical journals with a full data sharing policy: survey of studies published in and . BMJ. 2018; 360:k400. PMC: 5809812. DOI: 10.1136/bmj.k400. View

2.
Stodden V, Guo P, Ma Z . Toward Reproducible Computational Research: An Empirical Analysis of Data and Code Policy Adoption by Journals. PLoS One. 2013; 8(6):e67111. PMC: 3689732. DOI: 10.1371/journal.pone.0067111. View

3.
Newcombe R . Two-sided confidence intervals for the single proportion: comparison of seven methods. Stat Med. 1998; 17(8):857-72. DOI: 10.1002/(sici)1097-0258(19980430)17:8<857::aid-sim777>3.0.co;2-e. View

4.
Oberauer K, Lewandowsky S, Awh E, Brown G, Conway A, Cowan N . Benchmarks for models of short-term and working memory. Psychol Bull. 2018; 144(9):885-958. DOI: 10.1037/bul0000153. View

5.
Vines T, Andrew R, Bock D, Franklin M, Gilbert K, Kane N . Mandated data archiving greatly improves access to research data. FASEB J. 2013; 27(4):1304-8. DOI: 10.1096/fj.12-218164. View