» Articles » PMID: 38787295

Examining Linguistic Differences in Electronic Health Records for Diverse Patients With Diabetes: Natural Language Processing Analysis

Overview
Journal JMIR Med Inform
Publisher JMIR Publications
Date 2024 May 24
PMID 38787295
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Individuals from minoritized racial and ethnic backgrounds experience pernicious and pervasive health disparities that have emerged, in part, from clinician bias.

Objective: We used a natural language processing approach to examine whether linguistic markers in electronic health record (EHR) notes differ based on the race and ethnicity of the patient. To validate this methodological approach, we also assessed the extent to which clinicians perceive linguistic markers to be indicative of bias.

Methods: In this cross-sectional study, we extracted EHR notes for patients who were aged 18 years or older; had more than 5 years of diabetes diagnosis codes; and received care between 2006 and 2014 from family physicians, general internists, or endocrinologists practicing in an urban, academic network of clinics. The race and ethnicity of patients were defined as White non-Hispanic, Black non-Hispanic, or Hispanic or Latino. We hypothesized that Sentiment Analysis and Social Cognition Engine (SEANCE) components (ie, negative adjectives, positive adjectives, joy words, fear and disgust words, politics words, respect words, trust verbs, and well-being words) and mean word count would be indicators of bias if racial differences emerged. We performed linear mixed effects analyses to examine the relationship between the outcomes of interest (the SEANCE components and word count) and patient race and ethnicity, controlling for patient age. To validate this approach, we asked clinicians to indicate the extent to which they thought variation in the use of SEANCE language domains for different racial and ethnic groups was reflective of bias in EHR notes.

Results: We examined EHR notes (n=12,905) of Black non-Hispanic, White non-Hispanic, and Hispanic or Latino patients (n=1562), who were seen by 281 physicians. A total of 27 clinicians participated in the validation study. In terms of bias, participants rated negative adjectives as 8.63 (SD 2.06), fear and disgust words as 8.11 (SD 2.15), and positive adjectives as 7.93 (SD 2.46) on a scale of 1 to 10, with 10 being extremely indicative of bias. Notes for Black non-Hispanic patients contained significantly more negative adjectives (coefficient 0.07, SE 0.02) and significantly more fear and disgust words (coefficient 0.007, SE 0.002) than those for White non-Hispanic patients. The notes for Hispanic or Latino patients included significantly fewer positive adjectives (coefficient -0.02, SE 0.007), trust verbs (coefficient -0.009, SE 0.004), and joy words (coefficient -0.03, SE 0.01) than those for White non-Hispanic patients.

Conclusions: This approach may enable physicians and researchers to identify and mitigate bias in medical interactions, with the goal of reducing health disparities stemming from bias.

Citing Articles

Identifying stigmatizing and positive/preferred language in obstetric clinical notes using natural language processing.

Scroggins J, Hulchafo I, Harkins S, Scharp D, Moen H, Davoudi A J Am Med Inform Assoc. 2024; 32(2):308-317.

PMID: 39569431 PMC: 11756426. DOI: 10.1093/jamia/ocae290.

References
1.
Penner L, Dovidio J, West T, Gaertner S, Albrecht T, Dailey R . Aversive Racism and Medical Interactions with Black Patients: A Field Study. J Exp Soc Psychol. 2010; 46(2):436-440. PMC: 2835170. DOI: 10.1016/j.jesp.2009.11.004. View

2.
Dunphy D, Stone P, Smith M . The general inquirer: further developments in a computer system for content analysis of verbal data in the social sciences. Behav Sci. 1965; 10(4):468-80. View

3.
Goddu A, OConor K, Lanzkron S, Saheed M, Saha S, Peek M . Do Words Matter? Stigmatizing Language and the Transmission of Bias in the Medical Record. J Gen Intern Med. 2018; 33(5):685-691. PMC: 5910343. DOI: 10.1007/s11606-017-4289-2. View

4.
Hall W, Chapman M, Lee K, Merino Y, Thomas T, Payne B . Implicit Racial/Ethnic Bias Among Health Care Professionals and Its Influence on Health Care Outcomes: A Systematic Review. Am J Public Health. 2015; 105(12):e60-76. PMC: 4638275. DOI: 10.2105/AJPH.2015.302903. View

5.
Siminoff L, Graham G, Gordon N . Cancer communication patterns and the influence of patient characteristics: disparities in information-giving and affective behaviors. Patient Educ Couns. 2006; 62(3):355-60. DOI: 10.1016/j.pec.2006.06.011. View