» Articles » PMID: 26342218

Desiderata for Computable Representations of Electronic Health Records-driven Phenotype Algorithms

Abstract

Background: Electronic health records (EHRs) are increasingly used for clinical and translational research through the creation of phenotype algorithms. Currently, phenotype algorithms are most commonly represented as noncomputable descriptive documents and knowledge artifacts that detail the protocols for querying diagnoses, symptoms, procedures, medications, and/or text-driven medical concepts, and are primarily meant for human comprehension. We present desiderata for developing a computable phenotype representation model (PheRM).

Methods: A team of clinicians and informaticians reviewed common features for multisite phenotype algorithms published in PheKB.org and existing phenotype representation platforms. We also evaluated well-known diagnostic criteria and clinical decision-making guidelines to encompass a broader category of algorithms.

Results: We propose 10 desired characteristics for a flexible, computable PheRM: (1) structure clinical data into queryable forms; (2) recommend use of a common data model, but also support customization for the variability and availability of EHR data among sites; (3) support both human-readable and computable representations of phenotype algorithms; (4) implement set operations and relational algebra for modeling phenotype algorithms; (5) represent phenotype criteria with structured rules; (6) support defining temporal relations between events; (7) use standardized terminologies and ontologies, and facilitate reuse of value sets; (8) define representations for text searching and natural language processing; (9) provide interfaces for external software algorithms; and (10) maintain backward compatibility.

Conclusion: A computable PheRM is needed for true phenotype portability and reliability across different EHR products and healthcare systems. These desiderata are a guide to inform the establishment and evolution of EHR phenotype algorithm authoring platforms and languages.

Citing Articles

Methods for identifying health status from routinely collected health data: An overview.

Liu M, Deng K, Wang M, He Q, Xu J, Li G Integr Med Res. 2025; 14(1):101100.

PMID: 39897572 PMC: 11786076. DOI: 10.1016/j.imr.2024.101100.


Toward a Computable Phenotype for Determining Eligibility of Lung Cancer Screening Using Electronic Health Records.

Yang S, Huang Y, Lou X, Lyu T, Wei R, Mehta H JCO Clin Cancer Inform. 2025; 9():e2400139.

PMID: 39818952 PMC: 11748906. DOI: 10.1200/CCI.24.00139.


Health equity innovation in precision medicine: data stewardship and agency to expand representation in clinicogenomics.

Silva P, Rahimzadeh V, Powell R, Husain J, Grossman S, Hansen A Health Res Policy Syst. 2024; 22(1):170.

PMID: 39695714 PMC: 11657299. DOI: 10.1186/s12961-024-01258-9.


Developing an automated algorithm for identification of children and adolescents with diabetes using electronic health records from the OneFlorida+ clinical research network.

Li P, Spector E, Alkhuzam K, Patel R, Donahoo W, Bost S Diabetes Obes Metab. 2024; 27(1):102-110.

PMID: 39344840 PMC: 11620941. DOI: 10.1111/dom.15987.


Using electronic health records for clinical pharmacology research: Challenges and considerations.

Jafari E, Blackman M, Karnes J, Van Driest S, Crawford D, Choi L Clin Transl Sci. 2024; 17(7):e13871.

PMID: 38943244 PMC: 11213823. DOI: 10.1111/cts.13871.


References
1.
Kizer K . Establishing health care performance standards in an era of consumerism. JAMA. 2001; 286(10):1213-7. DOI: 10.1001/jama.286.10.1213. View

2.
Aronson A . Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. Proc AMIA Symp. 2002; :17-21. PMC: 2243666. View

3.
Chapman W, Bridewell W, Hanbury P, Cooper G, Buchanan B . A simple algorithm for identifying negated findings and diseases in discharge summaries. J Biomed Inform. 2002; 34(5):301-10. DOI: 10.1006/jbin.2001.1029. View

4.
Helleman J, Goossen W . Modeling nursing care in health level 7 reference information model. Comput Inform Nurs. 2003; 21(1):37-45. DOI: 10.1097/00024665-200301000-00012. View

5.
Payne P, Johnson S, Starren J, Tilson H, Dowdy D . Breaking the translational barriers: the value of integrating biomedical informatics and translational research. J Investig Med. 2005; 53(4):192-200. DOI: 10.2310/6650.2005.00402. View