» Articles » PMID: 30741241

Cancer Phenotype Development: A Literature Review

Overview
Publisher IOS Press
Date 2019 Feb 12
PMID 30741241
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

EHR-based, computable phenotypes can be leveraged by healthcare organizations and researchers to improve the cohort identification process. The ability to identify patient cohorts using aspects of care and outcomes based on clinical characteristics or diagnostic conditions and/or risk factors presents opportunities to researchers targeting specific populations for drug development and disease interventions. The objective of this review was to summarize the literature describing the development and use of phenotypes for cohort identification of cancer patients. A survey of the literature indexed in PubMed was performed to identify studies using EHR-based phenotypes for use in cancer studies. Specific search criteria were formulated by leveraging a phenotype identification guideline developed by the Phenotypes, Data Standards, and Data Quality Core of the NIH Health Care Systems Research Collaboratory. The final set of articles was examined further to identify 1) the cancer of interest and 2) the different approaches used for phenotype development, validation and implementation. The articles reviewed were specific to breast cancer, colorectal cancer, ovarian cancer, and lung cancer. The approaches taken for phenotype development and validation varied slightly among the relevant publications. Four studies relied on chart review, three utilized machine learning techniques, one took an ontological approach, and one utilized natural language processing (NLP).

Citing Articles

Are ICD codes reliable for observational studies? Assessing coding consistency for data quality.

Nelson S, Yin Y, Trujillo Rivera E, Shao Y, Ma P, Tuttle M Digit Health. 2024; 10:20552076241297056.

PMID: 39493629 PMC: 11528819. DOI: 10.1177/20552076241297056.


Challenges and Opportunities for Data Science in Women's Health.

Edwards T, Greene C, Piekos J, Hellwege J, Hampton G, Jasper E Annu Rev Biomed Data Sci. 2023; 6:23-45.

PMID: 37040736 PMC: 10877578. DOI: 10.1146/annurev-biodatasci-020722-105958.


Electronic Medical Record-Based Case Phenotyping for the Charlson Conditions: Scoping Review.

Lee S, Doktorchik C, Martin E, DSouza A, Eastwood C, Shaheen A JMIR Med Inform. 2021; 9(2):e23934.

PMID: 33522976 PMC: 7884219. DOI: 10.2196/23934.


Natural Language Processing of Serum Protein Electrophoresis Reports in the Veterans Affairs Health Care System.

Ryu J, Zimolzak A JCO Clin Cancer Inform. 2020; 4:749-756.

PMID: 32813561 PMC: 7477876. DOI: 10.1200/CCI.19.00167.