» Articles » PMID: 27629872

An Information Model for Computable Cancer Phenotypes

Overview
Publisher Biomed Central
Date 2016 Sep 16
PMID 27629872
Citations 16
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Standards, methods, and tools supporting the integration of clinical data and genomic information are an area of significant need and rapid growth in biomedical informatics. Integration of cancer clinical data and cancer genomic information poses unique challenges, because of the high volume and complexity of clinical data, as well as the heterogeneity and instability of cancer genome data when compared with germline data. Current information models of clinical and genomic data are not sufficiently expressive to represent individual observations and to aggregate those observations into longitudinal summaries over the course of cancer care. These models are acutely needed to support the development of systems and tools for generating the so called clinical "deep phenotype" of individual cancer patients, a process which remains almost entirely manual in cancer research and precision medicine.

Methods: Reviews of existing ontologies and interviews with cancer researchers were used to inform iterative development of a cancer phenotype information model. We translated a subset of the Fast Healthcare Interoperability Resources (FHIR) models into the OWL 2 Description Logic (DL) representation, and added extensions as needed for modeling cancer phenotypes with terms derived from the NCI Thesaurus. Models were validated with domain experts and evaluated against competency questions.

Results: The DeepPhe Information model represents cancer phenotype data at increasing levels of abstraction from mention level in clinical documents to summaries of key events and findings. We describe the model using breast cancer as an example, depicting methods to represent phenotypic features of cancers, tumors, treatment regimens, and specific biologic behaviors that span the entire course of a patient's disease.

Conclusions: We present a multi-scale information model for representing individual document mentions, document level classifications, episodes along a disease course, and phenotype summarization, linking individual observations to high-level summaries in support of subsequent integration and analysis.

Citing Articles

HL7 FHIR-based tools and initiatives to support clinical research: a scoping review.

Duda S, Kennedy N, Conway D, Cheng A, Nguyen V, Zayas-Caban T J Am Med Inform Assoc. 2022; 29(9):1642-1653.

PMID: 35818340 PMC: 9382376. DOI: 10.1093/jamia/ocac105.


A cancer graph: a lung cancer property graph database in Neo4j.

Tuck D BMC Res Notes. 2022; 15(1):45.

PMID: 35164854 PMC: 8842806. DOI: 10.1186/s13104-022-05912-9.


CQL4NLP: Development and Integration of FHIR NLP Extensions in Clinical Quality Language for EHR-driven Phenotyping.

Wen A, Rasmussen L, Stone D, Liu S, Kiefer R, Adekkanattu P AMIA Jt Summits Transl Sci Proc. 2021; 2021:624-633.

PMID: 34457178 PMC: 8378647.


Formal representation of patients' care context data: the path to improving the electronic health record.

Colicchio T, Dissanayake P, Cimino J J Am Med Inform Assoc. 2020; 27(11):1648-1657.

PMID: 32935127 PMC: 7671623. DOI: 10.1093/jamia/ocaa134.


Interactive Exploration of Longitudinal Cancer Patient Histories Extracted From Clinical Text.

Yuan Z, Finan S, Warner J, Savova G, Hochheiser H JCO Clin Cancer Inform. 2020; 4:412-420.

PMID: 32383981 PMC: 7265796. DOI: 10.1200/CCI.19.00115.


References
1.
Sojic A, Kutz O . Open biomedical pluralism: formalising knowledge about breast cancer phenotypes. J Biomed Semantics. 2012; 3 Suppl 2:S3. PMC: 3448532. DOI: 10.1186/2041-1480-3-S2-S3. View

2.
Komatsoulis G, Warzel D, Hartel F, Shanbhag K, Chilukuri R, Fragoso G . caCORE version 3: Implementation of a model driven, service-oriented architecture for semantic interoperability. J Biomed Inform. 2007; 41(1):106-23. PMC: 2254758. DOI: 10.1016/j.jbi.2007.03.009. View

3.
Denny J, Ritchie M, Basford M, Pulley J, Bastarache L, Brown-Gentry K . PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations. Bioinformatics. 2010; 26(9):1205-10. PMC: 2859132. DOI: 10.1093/bioinformatics/btq126. View

4.
Hiatt R, Tai C, Blayney D, Deapen D, Hogarth M, Kizer K . Leveraging state cancer registries to measure and improve the quality of cancer care: a potential strategy for California and beyond. J Natl Cancer Inst. 2015; 107(5). DOI: 10.1093/jnci/djv047. View

5.
Alterovitz G, Warner J, Zhang P, Chen Y, Ullman-Cullere M, Kreda D . SMART on FHIR Genomics: facilitating standardized clinico-genomic apps. J Am Med Inform Assoc. 2015; 22(6):1173-8. PMC: 11737838. DOI: 10.1093/jamia/ocv045. View