» Articles » PMID: 34697637

OBO Foundry in 2021: Operationalizing Open Data Principles to Evaluate Ontologies

Abstract

Biological ontologies are used to organize, curate and interpret the vast quantities of data arising from biological experiments. While this works well when using a single ontology, integrating multiple ontologies can be problematic, as they are developed independently, which can lead to incompatibilities. The Open Biological and Biomedical Ontologies (OBO) Foundry was created to address this by facilitating the development, harmonization, application and sharing of ontologies, guided by a set of overarching principles. One challenge in reaching these goals was that the OBO principles were not originally encoded in a precise fashion, and interpretation was subjective. Here, we show how we have addressed this by formally encoding the OBO principles as operational rules and implementing a suite of automated validation checks and a dashboard for objectively evaluating each ontology's compliance with each principle. This entailed a substantial effort to curate metadata across all ontologies and to coordinate with individual stakeholders. We have applied these checks across the full OBO suite of ontologies, revealing areas where individual ontologies require changes to conform to our principles. Our work demonstrates how a sizable, federated community can be organized and evaluated on objective criteria that help improve overall quality and interoperability, which is vital for the sustenance of the OBO project and towards the overall goals of making data Findable, Accessible, Interoperable, and Reusable (FAIR). Database URL http://obofoundry.org/.

Citing Articles

A semantic approach to mapping the Provenance Ontology to Basic Formal Ontology.

Prudhomme T, De Colle G, Liebers A, Sculley A, Xie P, Cohen S Sci Data. 2025; 12(1):282.

PMID: 39962095 PMC: 11833102. DOI: 10.1038/s41597-025-04580-1.


Increased discoverability of rare disease datasets through knowledge graph integration.

Braun I, Hartley E, Olson D, Matentzoglu N, Schaper K, Walls R JAMIA Open. 2025; 8(1):ooaf001.

PMID: 39926165 PMC: 11806703. DOI: 10.1093/jamiaopen/ooaf001.


The Representational Challenge of Integration and Interoperability in Transformed Health Ecosystems.

Blobel B, Oemig F, Ruotsalainen P, Brochhausen M, Sexton K, Giacomini M J Pers Med. 2025; 15(1).

PMID: 39852197 PMC: 11766756. DOI: 10.3390/jpm15010004.


A change language for ontologies and knowledge graphs.

Hegde H, Vendetti J, Goutte-Gattat D, Caufield J, Graybeal J, Harris N Database (Oxford). 2025; 2025.

PMID: 39841813 PMC: 11753292. DOI: 10.1093/database/baae133.


Standardized pipelines support and facilitate integration of diverse datasets at the Rat Genome Database.

Smith J, Tutaj M, Thota J, Lamers L, Gibson A, Kundurthi A Database (Oxford). 2025; 2025.

PMID: 39841812 PMC: 11753291. DOI: 10.1093/database/baae132.


References
1.
Ashburner M, Mungall C, Lewis S . Ontologies for biologists: a community model for the annotation of genomic data. Cold Spring Harb Symp Quant Biol. 2004; 68:227-35. DOI: 10.1101/sqb.2003.68.227. View

2.
Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W . The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol. 2007; 25(11):1251-5. PMC: 2814061. DOI: 10.1038/nbt1346. View

3.
Blake J, Bult C . Beyond the data deluge: data integration and bio-ontologies. J Biomed Inform. 2006; 39(3):314-20. DOI: 10.1016/j.jbi.2006.01.003. View

4.
Matentzoglu N, Malone J, Mungall C, Stevens R . MIRO: guidelines for minimum information for the reporting of an ontology. J Biomed Semantics. 2018; 9(1):6. PMC: 5774126. DOI: 10.1186/s13326-017-0172-7. View

5.
Jackson R, Balhoff J, Douglass E, Harris N, Mungall C, Overton J . ROBOT: A Tool for Automating Ontology Workflows. BMC Bioinformatics. 2019; 20(1):407. PMC: 6664714. DOI: 10.1186/s12859-019-3002-3. View