» Articles » PMID: 37953304

The DO-KB Knowledgebase: a 20-year Journey Developing the Disease Open Science Ecosystem

Abstract

In 2003, the Human Disease Ontology (DO, https://disease-ontology.org/) was established at Northwestern University. In the intervening 20 years, the DO has expanded to become a highly-utilized disease knowledge resource. Serving as the nomenclature and classification standard for human diseases, the DO provides a stable, etiology-based structure integrating mechanistic drivers of human disease. Over the past two decades the DO has grown from a collection of clinical vocabularies, into an expertly curated semantic resource of over 11300 common and rare diseases linking disease concepts through more than 37000 vocabulary cross mappings (v2023-08-08). Here, we introduce the recently launched DO Knowledgebase (DO-KB), which expands the DO's representation of the diseaseome and enhances the findability, accessibility, interoperability and reusability (FAIR) of disease data through a new SPARQL service and new Faceted Search Interface. The DO-KB is an integrated data system, built upon the DO's semantic disease knowledge backbone, with resources that expose and connect the DO's semantic knowledge with disease-related data across Open Linked Data resources. This update includes descriptions of efforts to assess the DO's global impact and improvements to data quality and content, with emphasis on changes in the last two years.

Citing Articles

A network medicine approach to investigating ME/CFS pathogenesis in severely ill patients: a pilot study.

Hung L, Wu C, Chang C, Li P, Hicks K, Dibble J Front Hum Neurosci. 2025; 19:1509346.

PMID: 39996021 PMC: 11847890. DOI: 10.3389/fnhum.2025.1509346.


Simplicity within biological complexity.

Przulj N, Malod-Dognin N Bioinform Adv. 2025; 5(1):vbae164.

PMID: 39927291 PMC: 11805345. DOI: 10.1093/bioadv/vbae164.


AMEND 2.0: module identification and multi-omic data integration with multiplex-heterogeneous graphs.

Boyd S, Slawson C, Thompson J BMC Bioinformatics. 2025; 26(1):39.

PMID: 39910456 PMC: 11800622. DOI: 10.1186/s12859-025-06063-x.


Standardized pipelines support and facilitate integration of diverse datasets at the Rat Genome Database.

Smith J, Tutaj M, Thota J, Lamers L, Gibson A, Kundurthi A Database (Oxford). 2025; 2025.

PMID: 39841812 PMC: 11753291. DOI: 10.1093/database/baae132.


Disease Network-Based Approaches to Study Comorbidity in Heart Failure: Current State and Future Perspectives.

Gomez-Ochoa S, Lanzer J, Levinson R Curr Heart Fail Rep. 2024; 22(1):6.

PMID: 39725810 PMC: 11671564. DOI: 10.1007/s11897-024-00693-7.


References
1.
Giglio M, Tauber R, Nadendla S, Munro J, Olley D, Ball S . ECO, the Evidence & Conclusion Ontology: community standard for evidence information. Nucleic Acids Res. 2018; 47(D1):D1186-D1194. PMC: 6323956. DOI: 10.1093/nar/gky1036. View

2.
Schriml L, Lichenstein R, Bisordi K, Bearer C, Baron J, Greene C . Modeling the enigma of complex disease etiology. J Transl Med. 2023; 21(1):148. PMC: 9957692. DOI: 10.1186/s12967-023-03987-x. View

3.
Rehm H, Berg J, Brooks L, Bustamante C, Evans J, Landrum M . ClinGen--the Clinical Genome Resource. N Engl J Med. 2015; 372(23):2235-42. PMC: 4474187. DOI: 10.1056/NEJMsr1406261. View

4.
Hoyt C, Hoyt A, Gyori B . Prediction and curation of missing biomedical identifier mappings with Biomappings. Bioinformatics. 2023; 39(4). PMC: 10076045. DOI: 10.1093/bioinformatics/btad130. View

5.
Krysiak K, Danos A, Saliba J, McMichael J, Coffman A, Kiwala S . CIViCdb 2022: evolution of an open-access cancer variant interpretation knowledgebase. Nucleic Acids Res. 2022; 51(D1):D1230-D1241. PMC: 9825608. DOI: 10.1093/nar/gkac979. View