» Articles » PMID: 28983419

Predicate Oriented Pattern Analysis for Biomedical Knowledge Discovery

Overview
Date 2017 Oct 7
PMID 28983419
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

In the current biomedical data movement, numerous efforts have been made to convert and normalize a large number of traditional structured and unstructured data (e.g., EHRs, reports) to semi-structured data (e.g., RDF, OWL). With the increasing number of semi-structured data coming into the biomedical community, data integration and knowledge discovery from heterogeneous domains become important research problem. In the application level, detection of related concepts among medical ontologies is an important goal of life science research. It is more crucial to figure out how different concepts are related within a single ontology or across multiple ontologies by analysing predicates in different knowledge bases. However, the world today is one of information explosion, and it is extremely difficult for biomedical researchers to find existing or potential predicates to perform linking among cross domain concepts without any support from schema pattern analysis. Therefore, there is a need for a mechanism to do predicate oriented pattern analysis to partition heterogeneous ontologies into closer small topics and do query generation to discover cross domain knowledge from each topic. In this paper, we present such a model that predicates oriented pattern analysis based on their close relationship and generates a similarity matrix. Based on this similarity matrix, we apply an innovated unsupervised learning algorithm to partition large data sets into smaller and closer topics and generate meaningful queries to fully discover knowledge over a set of interlinked data sources. We have implemented a prototype system named BmQGen and evaluate the proposed model with colorectal surgical cohort from the Mayo Clinic.

Citing Articles

Constructing co-occurrence network embeddings to assist association extraction for COVID-19 and other coronavirus infectious diseases.

Oniani D, Jiang G, Liu H, Shen F J Am Med Inform Assoc. 2020; 27(8):1259-1267.

PMID: 32458963 PMC: 7314034. DOI: 10.1093/jamia/ocaa117.


Detecting Lifestyle Risk Factors for Chronic Kidney Disease With Comorbidities: Association Rule Mining Analysis of Web-Based Survey Data.

Peng S, Shen F, Wen A, Wang L, Fan Y, Liu X J Med Internet Res. 2019; 21(12):e14204.

PMID: 31821152 PMC: 6930505. DOI: 10.2196/14204.


Detection of Surgical Site Infection Utilizing Automated Feature Generation in Clinical Notes.

Shen F, Larson D, Naessens J, Habermann E, Liu H, Sohn S J Healthc Inform Res. 2019; 3(3):267-282.

PMID: 31728432 PMC: 6855398. DOI: 10.1007/s41666-018-0042-9.


Self-management interventions for chronic kidney disease: a systematic review and meta-analysis.

Peng S, He J, Huang J, Lun L, Zeng J, Zeng S BMC Nephrol. 2019; 20(1):142.

PMID: 31027481 PMC: 6486699. DOI: 10.1186/s12882-019-1309-y.


Incorporating Knowledge-Driven Insights into a Collaborative Filtering Model to Facilitate the Differential Diagnosis of Rare Diseases.

Shen F, Liu H AMIA Annu Symp Proc. 2019; 2018:1505-1514.

PMID: 30815196 PMC: 6371266.


References
1.
Dembele D, Kastner P . Fuzzy C-means method for clustering microarray data. Bioinformatics. 2003; 19(8):973-80. DOI: 10.1093/bioinformatics/btg119. View

2.
Kuhn M, Szklarczyk D, Franceschini A, von Mering C, Jensen L, Bork P . STITCH 3: zooming in on protein-chemical interactions. Nucleic Acids Res. 2011; 40(Database issue):D876-80. PMC: 3245073. DOI: 10.1093/nar/gkr1011. View

3.
Kinnings S, Liu N, Buchmeier N, Tonge P, Xie L, Bourne P . Drug discovery using chemical systems biology: repositioning the safe medicine Comtan to treat multi-drug and extensively drug resistant tuberculosis. PLoS Comput Biol. 2009; 5(7):e1000423. PMC: 2699117. DOI: 10.1371/journal.pcbi.1000423. View

4.
Johnson S . Hierarchical clustering schemes. Psychometrika. 1967; 32(3):241-54. DOI: 10.1007/BF02289588. View

5.
Garcia-Serna R, Ursu O, Oprea T, Mestres J . iPHACE: integrative navigation in pharmacological space. Bioinformatics. 2010; 26(7):985-6. PMC: 2844997. DOI: 10.1093/bioinformatics/btq061. View