» Articles » PMID: 31365089

Challenges with Quality of Race and Ethnicity Data in Observational Databases

Overview
Date 2019 Aug 1
PMID 31365089
Citations 90
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: We sought to assess the quality of race and ethnicity information in observational health databases, including electronic health records (EHRs), and to propose patient self-recording as an improvement strategy.

Materials And Methods: We assessed completeness of race and ethnicity information in large observational health databases in the United States (Healthcare Cost and Utilization Project and Optum Labs), and at a single healthcare system in New York City serving a racially and ethnically diverse population. We compared race and ethnicity data collected via administrative processes with data recorded directly by respondents via paper surveys (National Health and Nutrition Examination Survey and Hospital Consumer Assessment of Healthcare Providers and Systems). Respondent-recorded data were considered the gold standard for the collection of race and ethnicity information.

Results: Among the 160 million patients from the Healthcare Cost and Utilization Project and Optum Labs datasets, race or ethnicity was unknown for 25%. Among the 2.4 million patients in the single New York City healthcare system's EHR, race or ethnicity was unknown for 57%. However, when patients directly recorded their race and ethnicity, 86% provided clinically meaningful information, and 66% of patients reported information that was discrepant with the EHR.

Discussion: Race and ethnicity data are critical to support precision medicine initiatives and to determine healthcare disparities; however, the quality of this information in observational databases is concerning. Patient self-recording through the use of patient-facing tools can substantially increase the quality of the information while engaging patients in their health.

Conclusions: Patient self-recording may improve the completeness of race and ethnicity information.

Citing Articles

Bias in Prediction Models to Identify Patients With Colorectal Cancer at High Risk for Readmission After Resection.

Lucas M, Schootman M, Laryea J, Orcutt S, Li C, Ying J JCO Clin Cancer Inform. 2025; 8.

PMID: 39831110 PMC: 11741203. DOI: 10.1200/CCI.23.00194.


Evaluating racial and ethnic disparities in antibiotic treatment for pneumonia patients in a major academic health system.

Evans D, Fortin-Leung K, Kumar V, Ma Y, Asrani R, Wiley Z Antimicrob Steward Healthc Epidemiol. 2025; 4(1):e221.

PMID: 39758876 PMC: 11696603. DOI: 10.1017/ash.2024.472.


Equity in cancer care: mixed methods clinical utility analysis of the Nursing Equity Assessment Tool (NEAT) to identify disadvantage in newly diagnosed cancer patients.

Chung H, Crone E, Gough K, Hyatt A, Milne D, Krishnasamy M Support Care Cancer. 2024; 33(1):60.

PMID: 39738715 PMC: 11683018. DOI: 10.1007/s00520-024-09094-x.


Relationships Between Hearing-Related and Health-Related Variables in Academic Progress of Children With Unilateral Hearing Loss.

Picou E, Davis H, Tang L, Bastarache L, Tharpe A J Speech Lang Hear Res. 2024; 68(1):364-376.

PMID: 39671254 PMC: 11842058. DOI: 10.1044/2024_JSLHR-24-00133.


Contextualized race and ethnicity annotations for clinical text from MIMIC-III.

Bear Dont Walk 4th O, Pichon A, Reyes Nieva H, Sun T, Li J, Joseph J Sci Data. 2024; 11(1):1332.

PMID: 39638783 PMC: 11621419. DOI: 10.1038/s41597-024-04183-2.


References
1.
Sholle E, Pinheiro L, Adekkanattu P, Davila M, Johnson S, Pathak J . Underserved populations with missing race ethnicity data differ significantly from those with structured race/ethnicity documentation. J Am Med Inform Assoc. 2019; 26(8-9):722-729. PMC: 6696506. DOI: 10.1093/jamia/ocz040. View

2.
Kressin N . Race/ethnicity identification: vital for disparities research, quality improvement, and much more than "meets the eye". Med Care. 2015; 53(8):663-5. DOI: 10.1097/MLR.0000000000000409. View

3.
Buntin M, Ayanian J . Social Risk Factors and Equity in Medicare Payment. N Engl J Med. 2017; 376(6):507-510. DOI: 10.1056/NEJMp1700081. View

4.
Chakkalakal R, Green J, Krumholz H, Nallamothu B . Standardized data collection practices and the racial/ethnic distribution of hospitalized patients. Med Care. 2015; 53(8):666-72. PMC: 4503513. DOI: 10.1097/MLR.0000000000000392. View

5.
Yudell M, Roberts D, DeSalle R, Tishkoff S . SCIENCE AND SOCIETY. Taking race out of human genetics. Science. 2016; 351(6273):564-5. DOI: 10.1126/science.aac4951. View