» Articles » PMID: 36402838

Unifying the Identification of Biomedical Entities with the Bioregistry

Abstract

The standardized identification of biomedical entities is a cornerstone of interoperability, reuse, and data integration in the life sciences. Several registries have been developed to catalog resources maintaining identifiers for biomedical entities such as small molecules, proteins, cell lines, and clinical trials. However, existing registries have struggled to provide sufficient coverage and metadata standards that meet the evolving needs of modern life sciences researchers. Here, we introduce the Bioregistry, an integrative, open, community-driven metaregistry that synthesizes and substantially expands upon 23 existing registries. The Bioregistry addresses the need for a sustainable registry by leveraging public infrastructure and automation, and employing a progressive governance model centered around open code and open data to foster community contribution. The Bioregistry can be used to support the standardized annotation of data, models, ontologies, and scientific literature, thereby promoting their interoperability and reuse. The Bioregistry can be accessed through https://bioregistry.io and its source code and data are available under the MIT and CC0 Licenses at https://github.com/biopragmatics/bioregistry .

Citing Articles

The Proteomics Standards Initiative Standardized Formats for Spectral Libraries and Fragment Ion Peak Annotations: mzSpecLib and mzPAF.

Klein J, Lam H, Mak T, Bittremieux W, Perez-Riverol Y, Gabriels R Anal Chem. 2024; 96(46):18491-18501.

PMID: 39514576 PMC: 11579979. DOI: 10.1021/acs.analchem.4c04091.


A framework for integrating biomedical knowledge in Wikidata with open biological and biomedical ontologies and MeSH keywords.

Turki H, Chebil K, Dossou B, Emezue C, Owodunni A, Hadj Taieb M Heliyon. 2024; 10(19):e38448.

PMID: 39403518 PMC: 11471508. DOI: 10.1016/j.heliyon.2024.e38448.


Beyond protein lists: AI-assisted interpretation of proteomic investigations in the context of evolving scientific knowledge.

Gyori B, Vitek O Nat Methods. 2024; 21(8):1387-1389.

PMID: 39122950 DOI: 10.1038/s41592-024-02324-4.


A Practical Approach to Using the Genomic Standards Consortium MIxS Reporting Standard for Comparative Genomics and Metagenomics.

Eloe-Fadrosh E, Mungall C, Miller M, Smith M, Patil S, Kelliher J Methods Mol Biol. 2024; 2802:587-609.

PMID: 38819573 DOI: 10.1007/978-1-0716-3838-5_20.


The O3 guidelines: open data, open code, and open infrastructure for sustainable curated scientific resources.

Hoyt C, Gyori B Sci Data. 2024; 11(1):547.

PMID: 38811583 PMC: 11136952. DOI: 10.1038/s41597-024-03406-w.


References
1.
Wong J, Franz M, Siper M, Fong D, Durupinar F, Dallago C . Author-sourced capture of pathway knowledge in computable form using Biofactoid. Elife. 2021; 10. PMC: 8683078. DOI: 10.7554/eLife.68292. View

2.
Gyori B, Bachman J, Subramanian K, Muhlich J, Galescu L, Sorger P . From word models to executable models of signaling networks using automated assembly. Mol Syst Biol. 2017; 13(11):954. PMC: 5731347. DOI: 10.15252/msb.20177651. View

3.
Doherty L, Mills C, Boswell S, Liu X, Hoyt C, Gyori B . Integrating multi-omics data reveals function and therapeutic potential of deubiquitinating enzymes. Elife. 2022; 11. PMC: 9225015. DOI: 10.7554/eLife.72879. View

4.
Sansone S, McQuilton P, Rocca-Serra P, Gonzalez-Beltran A, Izzo M, Lister A . FAIRsharing as a community approach to standards, repositories and policies. Nat Biotechnol. 2019; 37(4):358-367. PMC: 6785156. DOI: 10.1038/s41587-019-0080-8. View

5.
Bonner S, Barrett I, Ye C, Swiers R, Engkvist O, Bender A . A review of biomedical datasets relating to drug discovery: a knowledge graph perspective. Brief Bioinform. 2022; 23(6). DOI: 10.1093/bib/bbac404. View