» Articles » PMID: 30294517

Knowledge-based Biomedical Data Science

Overview
Journal EPJ Data Sci
Date 2018 Oct 9
PMID 30294517
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Computational manipulation of knowledge is an important, and often under-appreciated, aspect of biomedical Data Science. The first Data Science initiative from the US National Institutes of Health was entitled "Big Data to Knowledge (BD2K)." The main emphasis of the more than $200M allocated to that program has been on "Big Data;" the "Knowledge" component has largely been the implicit assumption that the work will lead to new biomedical knowledge. However, there is long-standing and highly productive work in knowledge representation and reasoning, and computational processing of knowledge has a role in the world of Data Science. Knowledge-based biomedical Data Science involves the design and implementation of computer systems that about biomedicine. There are many ways in which a computational approach might act as if it knew something: for example, it might be able to answer a natural language question about a biomedical topic, or pass an exam; it might be able to use existing biomedical knowledge to rank or evaluate hypotheses; it might explain or interpret data in light of prior knowledge, either in a Bayesian or other sort of framework. These are all examples of automated reasoning that act on computational representations of knowledge. After a brief survey of existing approaches to knowledge-based data science, this position paper argues that such research is ripe for expansion, and expanded application.

Citing Articles

Knowledge-based approaches to drug discovery for rare diseases.

Alves V, Korn D, Pervitsky V, Thieme A, Capuzzi S, Baker N Drug Discov Today. 2021; 27(2):490-502.

PMID: 34718207 PMC: 9124594. DOI: 10.1016/j.drudis.2021.10.014.


Knowledge-Based Biomedical Data Science.

Callahan T, Tripodi I, Pielke-Lombardo H, Hunter L Annu Rev Biomed Data Sci. 2021; 3:23-41.

PMID: 33954284 PMC: 8095730. DOI: 10.1146/annurev-biodatasci-010820-091627.


COVID-19 Knowledge Extractor (COKE): A Tool and a Web Portal to Extract Drug - Target Protein Associations from the CORD-19 Corpus of Scientific Publications on COVID-19.

Korn D, Pervitsky V, Bobrowski T, Alves V, Schmitt C, Bizon C ChemRxiv. 2020; .

PMID: 33269341 PMC: 7709174. DOI: 10.26434/chemrxiv.13289222.


Pathway information extracted from 25 years of pathway figures.

Hanspers K, Riutta A, Summer-Kutmon M, Pico A Genome Biol. 2020; 21(1):273.

PMID: 33168034 PMC: 7649569. DOI: 10.1186/s13059-020-02181-2.

References
1.
Suthram S, Dudley J, Chiang A, Chen R, Hastie T, Butte A . Network-based elucidation of human disease similarities reveals common functional modules enriched for pluripotent drug targets. PLoS Comput Biol. 2010; 6(2):e1000662. PMC: 2816673. DOI: 10.1371/journal.pcbi.1000662. View

2.
Holford M, Krauthammer M . Mutadelic: mutation analysis using description logic inferencing capabilities. Bioinformatics. 2015; 31(23):3742-7. PMC: 6078193. DOI: 10.1093/bioinformatics/btv467. View

3.
Jansen K, Kim T, Coenen A, Saba V, Hardiker N . Harmonising Nursing Terminologies Using a Conceptual Framework. Stud Health Technol Inform. 2016; 225:471-5. View

4.
. Expansion of the Gene Ontology knowledgebase and resources. Nucleic Acids Res. 2016; 45(D1):D331-D338. PMC: 5210579. DOI: 10.1093/nar/gkw1108. View

5.
Leach S, Tipney H, Feng W, Baumgartner W, Kasliwal P, Schuyler R . Biomedical discovery acceleration, with applications to craniofacial development. PLoS Comput Biol. 2009; 5(3):e1000215. PMC: 2653649. DOI: 10.1371/journal.pcbi.1000215. View