» Articles » PMID: 32969760

Opportunities, Pitfalls, and Alternatives in Adapting Electronic Health Records for Health Services Research

Overview
Publisher Sage Publications
Date 2020 Sep 24
PMID 32969760
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

Electronic health records (EHRs) offer the potential to study large numbers of patients but are designed for clinical practice, not research. Despite the increasing availability of EHR data, their use in research comes with its own set of challenges. In this article, we describe some important considerations and potential solutions for commonly encountered problems when working with large-scale, EHR-derived data for health services and community-relevant health research. Specifically, using EHR data requires the researcher to define the relevant patient subpopulation, reliably identify the primary care provider, recognize the EHR as containing episodic (i.e., unstructured longitudinal) data, account for changes in health system composition and treatment options over time, understand that the EHR is not always well-organized and accurate, design methods to identify the same patient across multiple health systems, account for the enormous size of the EHR, and consider barriers to data access. Associations found in the EHR may be nonrepresentative of associations in the general population, but a clear understanding of the EHR-based associations can be enormously valuable to the process of improving outcomes for patients in learning health care systems. In the context of building 2 large-scale EHR-derived data sets for health services research, we describe the potential pitfalls of EHR data and propose some solutions for those planning to use EHR data in their research. As ever greater amounts of clinical data are amassed in the EHR, use of these data for research will become increasingly common and important. Attention to the intricacies of EHR data will allow for more informed analysis and interpretation of results from EHR-based data sets.

Citing Articles

From Basic to Extra Features: Hypergraph Transformer Pretrain-then-Finetuning for Balanced Clinical Predictions on EHR.

Xu R, Lu Y, Liu C, Chen Y, Sun Y, Hu X Proc Mach Learn Res. 2025; 248:182-197.

PMID: 40041452 PMC: 11876795.


Neighbourhood socioeconomic position, prenatal care and fulfilment of postpartum permanent contraception: Findings from a multisite cohort study.

Berg K, Bullington B, Gunzler D, Miller E, Boozer M, Serna T Reprod Female Child Health. 2024; 3(1).

PMID: 38737484 PMC: 11087039. DOI: 10.1002/rfc2.64.


Data gaps and opportunities for modeling cancer health equity.

Trentham-Dietz A, Corley D, Del Vecchio N, Greenlee R, Haas J, Hubbard R J Natl Cancer Inst Monogr. 2023; 2023(62):246-254.

PMID: 37947335 PMC: 11009506. DOI: 10.1093/jncimonographs/lgad025.


Challenges and Opportunities for Data Science in Women's Health.

Edwards T, Greene C, Piekos J, Hellwege J, Hampton G, Jasper E Annu Rev Biomed Data Sci. 2023; 6:23-45.

PMID: 37040736 PMC: 10877578. DOI: 10.1146/annurev-biodatasci-020722-105958.


A Path to Real-World Evidence in Critical Care Using Open-Source Data Harmonization Tools.

Heavner S, Anderson W, Kashyap R, Dasher P, Mathe E, Merson L Crit Care Explor. 2023; 5(4):e0893.

PMID: 37025303 PMC: 10072311. DOI: 10.1097/CCE.0000000000000893.


References
1.
Weiskopf N, Weng C . Methods and dimensions of electronic health record data quality assessment: enabling reuse for clinical research. J Am Med Inform Assoc. 2012; 20(1):144-51. PMC: 3555312. DOI: 10.1136/amiajnl-2011-000681. View

2.
Mai P, Wideroff L, Greene M, Graubard B . Prevalence of family history of breast, colorectal, prostate, and lung cancer in a population-based study. Public Health Genomics. 2010; 13(7-8):495-503. PMC: 3025897. DOI: 10.1159/000294469. View

3.
Rassen J, Bartels D, Schneeweiss S, Patrick A, Murk W . Measuring prevalence and incidence of chronic conditions in claims and electronic health record databases. Clin Epidemiol. 2018; 11:1-15. PMC: 6301730. DOI: 10.2147/CLEP.S181242. View

4.
Manuel D, Rosella L, Stukel T . Importance of accurately identifying disease in studies using electronic health records. BMJ. 2010; 341:c4226. DOI: 10.1136/bmj.c4226. View

5.
Verheij R, Curcin V, Delaney B, McGilchrist M . Possible Sources of Bias in Primary Care Electronic Health Record Data Use and Reuse. J Med Internet Res. 2018; 20(5):e185. PMC: 5997930. DOI: 10.2196/jmir.9134. View