» Articles » PMID: 39135800

Machine Learning-based Prediction Model for the Efficacy and Safety of Statins

Overview
Journal Front Pharmacol
Date 2024 Aug 13
PMID 39135800
Authors
Affiliations
Soon will be listed here.
Abstract

Objective: The appropriate use of statins plays a vital role in reducing the risk of atherosclerotic cardiovascular disease (ASCVD). However, due to changes in diet and lifestyle, there has been a significant increase in the number of individuals with high cholesterol levels. Therefore, it is crucial to ensure the rational use of statins. Adverse reactions associated with statins, including liver enzyme abnormalities and statin-associated muscle symptoms (SAMS), have impacted their widespread utilization. In this study, we aimed to develop a predictive model for statin efficacy and safety based on real-world clinical data using machine learning techniques.

Methods: We employed various data preprocessing techniques, such as improved random forest imputation and Borderline SMOTE oversampling, to handle the dataset. Boruta method was utilized for feature selection, and the dataset was divided into training and testing sets in a 7:3 ratio. Five algorithms, including logistic regression, naive Bayes, decision tree, random forest, and gradient boosting decision tree, were used to construct the predictive models. Ten-fold cross-validation and bootstrapping sampling were performed for internal and external validation. Additionally, SHAP (SHapley Additive exPlanations) was employed for feature interpretability. Ultimately, an accessible web-based platform for predicting statin efficacy and safety was established based on the optimal predictive model.

Results: The random forest algorithm exhibited the best performance among the five algorithms. The predictive models for LDL-C target attainment (AUC = 0.883, Accuracy = 0.868, Precision = 0.858, Recall = 0.863, F1 = 0.860, AUPRC = 0.906, MCC = 0.761), liver enzyme abnormalities (AUC = 0.964, Accuracy = 0.964, Precision = 0.967, Recall = 0.963, F1 = 0.965, AUPRC = 0.978, MCC = 0.938), and muscle pain/Creatine kinase (CK) abnormalities (AUC = 0.981, Accuracy = 0.980, Precision = 0.987, Recall = 0.975, F1 = 0.981, AUPRC = 0.987, MCC = 0.965) demonstrated favorable performance. The most important features of LDL-C target attainment prediction model was cerebral infarction, TG, PLT and HDL. The most important features of liver enzyme abnormalities model was CRP, CK and number of oral medications. Similarly, AST, ALT, PLT and number of oral medications were found to be important features for muscle pain/CK abnormalities. Based on the best-performing predictive model, a user-friendly web application was designed and implemented.

Conclusion: This study presented a machine learning-based predictive model for statin efficacy and safety. The platform developed can assist in guiding statin therapy decisions and optimizing treatment strategies. Further research and application of the model are warranted to improve the utilization of statin therapy.

References
1.
Georgoulis M, Chrysohoou C, Georgousopoulou E, Damigou E, Skoumas I, Pitsavos C . Long-term prognostic value of LDL-C, HDL-C, lp(a) and TG levels on cardiovascular disease incidence, by body weight status, dietary habits and lipid-lowering treatment: the ATTICA epidemiological cohort study (2002-2012). Lipids Health Dis. 2022; 21(1):141. PMC: 9762061. DOI: 10.1186/s12944-022-01747-2. View

2.
Greenland P, Lloyd-Jones D . Role of Coronary Artery Calcium Testing for Risk Assessment in Primary Prevention of Atherosclerotic Cardiovascular Disease: A Review. JAMA Cardiol. 2021; 7(2):219-224. DOI: 10.1001/jamacardio.2021.3948. View

3.
Wang Q, Zhong Y, Yuan J, Shao L, Zhang J, Tang L . Targeting therapy of hepatocellular carcinoma with doxorubicin prodrug PDOX increases anti-metastatic effect and reduces toxicity: a preclinical study. J Transl Med. 2013; 11:192. PMC: 3765954. DOI: 10.1186/1479-5876-11-192. View

4.
Sun B, Yew P, Chi C, Song M, Loth M, Zhang R . Development and application of pharmacological statin-associated muscle symptoms phenotyping algorithms using structured and unstructured electronic health records data. JAMIA Open. 2023; 6(4):ooad087. PMC: 10597587. DOI: 10.1093/jamiaopen/ooad087. View

5.
Li S, Liu H, Guo Y, Zhu C, Wu N, Xu R . Improvement of evaluation in Chinese patients with atherosclerotic cardiovascular disease using the very-high-risk refinement: a population-based study. Lancet Reg Health West Pac. 2021; 17:100286. PMC: 8551815. DOI: 10.1016/j.lanwpc.2021.100286. View