» Articles » PMID: 38058450

A Spatiotemporal XGBoost Model for PM Concentration Prediction and Its Application in Shanghai

Overview
Journal Heliyon
Specialty Social Sciences
Date 2023 Dec 7
PMID 38058450
Authors
Affiliations
Soon will be listed here.
Abstract

This paper innovatively constructed an analytical and forecasting framework to predict PM concentration levels for 16 municipal districts in Shanghai. By means of XGBoost parameters adjustment, empirical mode decomposition, and model fusion, improvements are made on XGBoost prediction accuracy and stability so that prediction deviation at extreme points can be avoided. The main findings of this paper can be summarized as follows: 1) Compared with the original model, the goodness of fit of the modified XGBoost model on the test set increased by 17 %, and the root mean square error decreased by 28 %; 2) The variation of PM concentration in Shanghai has a significant seasonal (cyclical) effect, and its fluctuation period is 3 months (a quarter). In winter, the frequency of extreme value points is significantly higher than that in other seasons; 3) In terms of spatial distribution, the PM concentration in the central city of Shanghai is higher than that in the rural areas, and the PM concentration gradually decreases from center city to the surrounding areas. The innovation and contribution of this paper can be summarized as follows: 1) EEMD algorithm verified by SSA was used to decompose the original model without reconstructing all subsequences and get the best weighing among each part of the hybrid model by using variable weight assignment; 2) The city was cut into pieces according to administrative districts in avoid of the duplicate analysis when utilizing advised Kriging interpolation; 3) IDW method was applied to verified Kriging interpolation to increase the accuracy; 4) The latitude and longitude were innovatively converted into the arc length of the corresponding spherical surface; 5) Hierarchical analysis method was used to obtain the order of importance among the PM monitoring stations, which could improve the accuracy and achieve dimension reduction.

Citing Articles

Seasonal and Meteorological Drivers of Hand, Foot, and Mouth Disease Outbreaks Using Data-Driven Machine Learning Models.

Lonlab P, Anupong S, Jainonthee C, Chadsuthi S Trop Med Infect Dis. 2025; 10(2).

PMID: 39998051 PMC: 11860531. DOI: 10.3390/tropicalmed10020048.

References
1.
Just A, De Carli M, Shtein A, Dorman M, Lyapustin A, Kloog I . Correcting Measurement Error in Satellite Aerosol Optical Depth with Machine Learning for Modeling PM in the Northeastern USA. Remote Sens (Basel). 2019; 10(5). PMC: 6497138. DOI: 10.3390/rs10050803. View

2.
Li X, Peng L, Yao X, Cui S, Hu Y, You C . Long short-term memory neural network for air pollutant concentration predictions: Method development and evaluation. Environ Pollut. 2017; 231(Pt 1):997-1004. DOI: 10.1016/j.envpol.2017.08.114. View

3.
Gui K, Che H, Zeng Z, Wang Y, Zhai S, Wang Z . Construction of a virtual PM observation network in China based on high-density surface meteorological observations using the Extreme Gradient Boosting model. Environ Int. 2020; 141:105801. DOI: 10.1016/j.envint.2020.105801. View

4.
Wang J, He L, Lu X, Zhou L, Tang H, Yan Y . A full-coverage estimation of PM concentrations using a hybrid XGBoost-WD model and WRF-simulated meteorological fields in the Yangtze River Delta Urban Agglomeration, China. Environ Res. 2021; 203:111799. DOI: 10.1016/j.envres.2021.111799. View

5.
Zhang Y, Zhang R, Ma Q, Wang Y, Wang Q, Huang Z . A feature selection and multi-model fusion-based approach of predicting air quality. ISA Trans. 2019; 100:210-220. DOI: 10.1016/j.isatra.2019.11.023. View