» Articles » PMID: 38896066

Identification of Hepatic Steatosis Among Persons with and Without HIV Using Natural Language Processing

Abstract

Background: Steatotic liver disease (SLD) is a growing phenomenon, and our understanding of its determinants has been limited by our ability to identify it clinically. Natural language processing (NLP) can potentially identify hepatic steatosis systematically within large clinical repositories of imaging reports. We validated the performance of an NLP algorithm for the identification of SLD in clinical imaging reports and applied this tool to a large population of people with and without HIV.

Methods: Patients were included in the analysis if they enrolled in the Veterans Aging Cohort Study between 2001 and 2017, had an imaging report inclusive of the liver, and had ≥2 years of observation before the imaging study. SLD was considered present when reports contained the terms "fatty," "steatosis," "steatotic," or "steatohepatitis." The performance of the SLD NLP algorithm was compared to a clinical review of 800 reports. We then applied the NLP algorithm to the first eligible imaging study and compared patient characteristics by SLD and HIV status.

Results: NLP achieved 100% sensitivity and 88.5% positive predictive value for the identification of SLD. When applied to 26,706 eligible Veterans Aging Cohort Study patient imaging reports, SLD was identified in 72.2% and did not significantly differ by HIV status. SLD was associated with a higher prevalence of metabolic comorbidities, alcohol use disorder, and hepatitis B and C, but not HIV infection.

Conclusions: While limited to those undergoing radiologic study, the NLP algorithm accurately identified SLD in people with and without HIV and offers a valuable tool to evaluate the determinants and consequences of hepatic steatosis.

Citing Articles

Automated identification of incidental hepatic steatosis on Emergency Department imaging using large language models.

Vong T, Rizer N, Jain V, Thompson V, Dredze M, Klein E Hepatol Commun. 2025; 9(3).

PMID: 39969431 PMC: 11841845. DOI: 10.1097/HC9.0000000000000638.

References
1.
Younossi Z, Koenig A, Abdelatif D, Fazel Y, Henry L, Wymer M . Global epidemiology of nonalcoholic fatty liver disease-Meta-analytic assessment of prevalence, incidence, and outcomes. Hepatology. 2015; 64(1):73-84. DOI: 10.1002/hep.28431. View

2.
Gaslightwala I, Bini E . Impact of human immunodeficiency virus infection on the prevalence and severity of steatosis in patients with chronic hepatitis C virus infection. J Hepatol. 2006; 44(6):1026-32. DOI: 10.1016/j.jhep.2006.02.009. View

3.
Eyre H, Chapman A, Peterson K, Shi J, Alba P, Jones M . Launching into clinical space with medspaCy: a new clinical text processing toolkit in Python. AMIA Annu Symp Proc. 2022; 2021:438-447. PMC: 8861690. View

4.
Natarajan Y, Kramer J, Yu X, Li L, Thrift A, El-Serag H . Risk of Cirrhosis and Hepatocellular Cancer in Patients With NAFLD and Normal Liver Enzymes. Hepatology. 2020; 72(4):1242-1252. PMC: 8318072. DOI: 10.1002/hep.31157. View

5.
Price J, Wang R, Seaberg E, Budoff M, Kingsley L, Palella F . The Association of Inflammatory Markers With Nonalcoholic Fatty Liver Disease Differs by Human Immunodeficiency Virus Serostatus. Open Forum Infect Dis. 2017; 4(3):ofx153. PMC: 5601080. DOI: 10.1093/ofid/ofx153. View