» Articles » PMID: 38870031

Validation of Multi-State EHR-Based Network for Disease Surveillance (MENDS) Data and Implications for Improving Data Quality and Representativeness

Overview
Specialty Public Health
Date 2024 Jun 13
PMID 38870031
Authors
Affiliations
Soon will be listed here.
Abstract

Introduction: Surveillance modernization efforts emphasize the potential use of electronic health record (EHR) data to inform public health surveillance and prevention. However, EHR data streams vary widely in their completeness, accuracy, and representativeness.

Methods: We developed a validation process for the Multi-State EHR-Based Network for Disease Surveillance (MENDS) pilot project to identify and resolve data quality issues that could affect chronic disease prevalence estimates. We examined MENDS validation processes from December 2020 through August 2023 across 5 data-contributing organizations and outlined steps to resolve data quality issues.

Results: We identified gaps in the EHR databases of data contributors and in the processes to extract, map, integrate, and analyze their EHR data. Examples of source-data problems included missing data on race and ethnicity and zip codes. Examples of data processing problems included duplicate or missing patient records, lower-than-expected volumes of data, use of multiple fields for a single data type, and implausible values.

Conclusion: Validation protocols identified critical errors in both EHR source data and in the processes used to transform these data for analysis. Our experience highlights the value and importance of data validation to improve data quality and the accuracy of surveillance estimates that use EHR data. The validation process and lessons learned can be applied broadly to other EHR-based surveillance efforts.

Citing Articles

Demonstrated Progress and Future Promise of Chronic Disease Data Modernization.

Turner K, Hohman K Prev Chronic Dis. 2024; 21:E86.

PMID: 39481011 PMC: 11567503. DOI: 10.5888/pcd21.240396.

References
1.
Tsiampalis T, Panagiotakos D . Methodological issues of the electronic health records' use in the context of epidemiological investigations, in light of missing data: a review of the recent literature. BMC Med Res Methodol. 2023; 23(1):180. PMC: 10410989. DOI: 10.1186/s12874-023-02004-5. View

2.
Ansari B, Hart-Malloy R, Rosenberg E, Trigg M, Martin E . Modeling the Potential Impact of Missing Race and Ethnicity Data in Infectious Disease Surveillance Systems on Disparity Measures: Scenario Analysis of Different Imputation Strategies. JMIR Public Health Surveill. 2022; 8(11):e38037. PMC: 9685511. DOI: 10.2196/38037. View

3.
Proumen R, Connolly H, Debick N, Hopkins R . Assessing the accuracy of electronic health record gender identity and REaL data at an academic medical center. BMC Health Serv Res. 2023; 23(1):884. PMC: 10463428. DOI: 10.1186/s12913-023-09825-6. View

4.
Yoon J, Mizrahi M, Ghalaty N, Jarvinen T, Ravi A, Brune P . EHR-Safe: generating high-fidelity and privacy-preserving synthetic electronic health records. NPJ Digit Med. 2023; 6(1):141. PMC: 10421926. DOI: 10.1038/s41746-023-00888-7. View

5.
Johnson J, Moore B, Hwang E, Hickner A, Yeo H . The accuracy of race & ethnicity data in US based healthcare databases: A systematic review. Am J Surg. 2023; 226(4):463-470. DOI: 10.1016/j.amjsurg.2023.05.011. View