A Machine Learning Model to Estimate Ambient PM Concentrations in Industrialized Highveld Region of South Africa
Overview
Authors
Affiliations
Exposure to fine particulate matter (PM) has been linked to a substantial disease burden globally, yet little has been done to estimate the population health risks of PM in South Africa due to the lack of high-resolution PM exposure estimates. We developed a random forest model to estimate daily PM concentrations at 1 km resolution in and around industrialized Gauteng Province, South Africa, by combining satellite aerosol optical depth (AOD), meteorology, land use, and socioeconomic data. We then compared PM concentrations in the study domain before and after the implementation of the new national air quality standards. We aimed to test whether machine learning models are suitable for regions with sparse ground observations such as South Africa and which predictors played important roles in PM modeling. The cross-validation R and Root Mean Square Error of our model was 0.80 and 9.40 μg/m, respectively. Satellite AOD, seasonal indicator, total precipitation, and population were among the most important predictors. Model-estimated PM levels successfully captured the temporal pattern recorded by ground observations. Spatially, the highest annual PM concentration appeared in central and northern Gauteng, including northern Johannesburg and the city of Tshwane. Since the 2016 changes in national PM standards, PM concentrations have decreased in most of our study region, although levels in Johannesburg and its surrounding areas have remained relatively constant. This is anadvanced PM model for South Africa with high prediction accuracy at the daily level and at a relatively high spatial resolution. Our study provided a reference for predictor selection, and our results can be used for a variety of purposes, including epidemiological research, burden of disease assessments, and policy evaluation.
Zhu Q, Zhang D, Wang W, DSouza R, Zhang H, Yang B Nat Ment Health. 2024; 2(4):379-387.
PMID: 39568497 PMC: 11575985. DOI: 10.1038/s44220-024-00210-8.
Prediction of atmospheric PM level by machine learning techniques in Isfahan, Iran.
Mohammadi F, Teiri H, Hajizadeh Y, Abdolahnejad A, Ebrahimi A Sci Rep. 2024; 14(1):2109.
PMID: 38267539 PMC: 10808097. DOI: 10.1038/s41598-024-52617-z.
Machine Learning Research Trends in Africa: A 30 Years Overview with Bibliometric Analysis Review.
Ezugwu A, Oyelade O, Ikotun A, Agushaka J, Ho Y Arch Comput Methods Eng. 2023; :1-31.
PMID: 37359741 PMC: 10148585. DOI: 10.1007/s11831-023-09930-z.
Effects of short-term PM exposure on blood lipids among 197,957 people in eastern China.
Liu Q, Wang Z, Lu J, Li Z, Martinez L, Tao B Sci Rep. 2023; 13(1):4505.
PMID: 36934119 PMC: 10024762. DOI: 10.1038/s41598-023-31513-y.
Tong C, Shi Z, Shi W, Zhang A Geohealth. 2022; 6(9):e2022GH000669.
PMID: 36101834 PMC: 9453924. DOI: 10.1029/2022GH000669.