» Articles » PMID: 39309369

Use of Machine Learning to Predict Onset of NAFLD in an All-Comers Cohort-Development and Validation in 2 Large Asian Cohorts

Overview
Journal Gastro Hep Adv
Specialty Gastroenterology
Date 2024 Sep 23
PMID 39309369
Authors
Affiliations
Soon will be listed here.
Abstract

Background And Aims: Nonalcoholic fatty liver disease (NAFLD) is one of the most common liver diseases. There are no universally accepted models that accurately predict time to onset of NAFLD. Machine learning (ML) models may allow prediction of such time-to-event (ie, survival) outcomes. This study aims to develop and independently validate ML-derived models to allow personalized prediction of time to onset of NAFLD in individuals who have no NAFLD at baseline.

Methods: The development dataset comprised 25,599 individuals from a South Korean NAFLD registry. A random 70:30 split divided it into training and internal validation sets. ML survival models (random survival forest, extra survival trees) were fitted, with time to NAFLD diagnosis in months as the target variable and routine anthropometric and laboratory parameters as predictors. The independent validation dataset comprised 16,173 individuals from a Chinese open dataset. Models were evaluated using the concordance index (c-index) and Brier score on both the internal and independent validation sets.

Results: The datasets (development vs independent validation) had 1,331,107 vs 543,874 person months of follow-up, NAFLD incidence of 25.7% (6584 individuals) vs 14.4% (2322 individuals), and median time to NAFLD onset of 60 (interquartile range 38-75) vs 24 (interquartile range 13-37) months, respectively. The ML models achieved a good c-index of >0.7 in the validation cohort-random survival forest 0.751 (95% confidence interval 0.742-0.759), extra survival trees 0.752 (95% confidence interval 0.744-0.762).

Conclusion: ML models can predict time-to-onset of NAFLD based on routine patient data. They can be used by clinicians to deliver personalized predictions to patients, which may facilitate patient counseling and clinical decision making on interval imaging timing.

References
1.
Sanyal D, Mukherjee P, Raychaudhuri M, Ghosh S, Mukherjee S, Chowdhury S . Profile of liver enzymes in non-alcoholic fatty liver disease in patients with impaired glucose tolerance and newly detected untreated type 2 diabetes. Indian J Endocrinol Metab. 2015; 19(5):597-601. PMC: 4566337. DOI: 10.4103/2230-8210.163172. View

2.
Lee C, Choe E, Choi J, Hwang Y, Lee Y, Park B . Health and Prevention Enhancement (H-PEACE): a retrospective, population-based cohort study conducted at the Seoul National University Hospital Gangnam Center, Korea. BMJ Open. 2018; 8(4):e019327. PMC: 5914782. DOI: 10.1136/bmjopen-2017-019327. View

3.
Sun D, Wu S, Liu W, Wang L, Chen Y, Zhang D . Association of low-density lipoprotein cholesterol within the normal range and NAFLD in the non-obese Chinese population: a cross-sectional and longitudinal study. BMJ Open. 2016; 6(12):e013781. PMC: 5168665. DOI: 10.1136/bmjopen-2016-013781. View

4.
Liang Y, Chen H, Liu Y, Hou X, Wei L, Bao Y . Association of MAFLD With Diabetes, Chronic Kidney Disease, and Cardiovascular Disease: A 4.6-Year Cohort Study in China. J Clin Endocrinol Metab. 2021; 107(1):88-97. PMC: 8684479. DOI: 10.1210/clinem/dgab641. View

5.
Cheung A, Ahmed A . Nonalcoholic Fatty Liver Disease and Chronic Kidney Disease: A Review of Links and Risks. Clin Exp Gastroenterol. 2021; 14:457-465. PMC: 8607580. DOI: 10.2147/CEG.S226130. View