» Articles » PMID: 36949434

A Study on Predicting the Length of Hospital Stay for Chinese Patients with Ischemic Stroke Based on the XGBoost Algorithm

Overview
Publisher Biomed Central
Date 2023 Mar 23
PMID 36949434
Authors
Affiliations
Soon will be listed here.
Abstract

Background: The incidence of stroke is a challenge in China, as stroke imposes a heavy burden on families, national health services, social services, and the economy. The length of hospital stay (LOS) is an essential indicator of utilization of medical services and is usually used to assess the efficiency of hospital management and patient quality of care. This study established a prediction model based on a machine learning algorithm to predict ischemic stroke patients' LOS.

Methods: A total of 18,195 ischemic stroke patients' electronic medical records and 28 attributes were extracted from electronic medical records in a large comprehensive hospital in China. The prediction of LOS was regarded as a multi classification problem, and LOS was divided into three categories: 1-7 days, 8-14 days and more than 14 days. After preprocessing the data and feature selection, the XGBoost algorithm was used to build a machine learning model. Ten fold cross-validation was used for model validation. The accuracy (ACC), recall rate (RE) and F1 measure were used to evaluate the performance of the prediction model of LOS of ischemic stroke patients. Finally, the XGBoost algorithm was used to identify and remove irrelevant features by ranking all attributes based on feature importance.

Results: Compared with the naive Bayesian algorithm, logistic region algorithm, decision tree classifier algorithm and ADaBoost classifier algorithm, the XGBoot algorithm has higher ACC, RE and F1 measure. The average ACC, RE and F1 measure were 0.89, 0.89 and 0.89 under the 10-fold cross-validation. According to the analysis of the importance of features, the LOS of ischemic stroke patients was affected by demographic characteristics, past medical history, admission examination features, and operation characteristics. Finally, the features in terms of hemiplegia aphasia, MRS, NIHSS, TIA, Operation or not, coma index etc. were found to be the top features in importance in predicting the LOS of ischemic stroke patients.

Conclusions: The XGBoost algorithm was an appropriate machine learning method for predicting the LOS of patients with ischemic stroke. Based on the prediction model, an intelligent medical management prediction system could be developed to predict the LOS based on ischemic stroke patients' electronic medical records.

Citing Articles

Prediction of poststroke independent walking using machine learning: a retrospective study.

Tang Z, Su W, Liu T, Lu H, Liu Y, Li H BMC Neurol. 2024; 24(1):332.

PMID: 39256684 PMC: 11385990. DOI: 10.1186/s12883-024-03849-z.


Integrated single cell-RNA sequencing and Mendelian randomization for ischemic stroke and metabolic syndrome.

Li J, Shen S, Yu C, Sun S, Zheng P iScience. 2024; 27(7):110240.

PMID: 39021802 PMC: 11253530. DOI: 10.1016/j.isci.2024.110240.


Application of machine learning models on predicting the length of hospital stay in fragility fracture patients.

Lai C, Mok P, Chau W, Law S BMC Med Inform Decis Mak. 2024; 24(1):26.

PMID: 38291406 PMC: 10826155. DOI: 10.1186/s12911-024-02417-2.


Supervised machine learning algorithms to predict the duration and risk of long-term hospitalization in HIV-infected individuals: a retrospective study.

Li J, Hao Y, Liu Y, Wu L, Liang H, Ni L Front Public Health. 2024; 11:1282324.

PMID: 38249414 PMC: 10796994. DOI: 10.3389/fpubh.2023.1282324.


Development of a Machine Learning Model to Predict the Use of Surgery in Patients With Rheumatoid Arthritis.

Baxter N, Lin C, Wallace B, Chen J, Kuo C, Chung K Arthritis Care Res (Hoboken). 2023; 76(5):636-643.

PMID: 38155538 PMC: 11039369. DOI: 10.1002/acr.25287.

References
1.
. Global, regional, and national age-sex specific mortality for 264 causes of death, 1980-2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet. 2017; 390(10100):1151-1210. PMC: 5605883. DOI: 10.1016/S0140-6736(17)32152-9. View

2.
Luo L, Li J, Lian S, Zeng X, Sun L, Li C . Using machine learning approaches to predict high-cost chronic obstructive pulmonary disease patients in China. Health Informatics J. 2019; 26(3):1577-1598. DOI: 10.1177/1460458219881335. View

3.
Wang W, Jiang B, Sun H, Ru X, Sun D, Wang L . Prevalence, Incidence, and Mortality of Stroke in China: Results from a Nationwide Population-Based Survey of 480 687 Adults. Circulation. 2017; 135(8):759-771. DOI: 10.1161/CIRCULATIONAHA.116.025250. View

4.
Feigin V, Nguyen G, Cercy K, Johnson C, Alam T, Parmar P . Global, Regional, and Country-Specific Lifetime Risks of Stroke, 1990 and 2016. N Engl J Med. 2018; 379(25):2429-2437. PMC: 6247346. DOI: 10.1056/NEJMoa1804492. View

5.
Badaki-Makun O, Casella J, Tackett S, Tao X, Chamberlain J . Association of Antibiotic Choice With Hospital Length of Stay and Risk Factors for Readmission in Patients With Sickle Cell Disease and Acute Chest Syndrome: An Observational Cohort Study. Ann Emerg Med. 2020; 76(3S):S37-S45. DOI: 10.1016/j.annemergmed.2020.08.011. View