Monitoring Seasonal Influenza Epidemics by Using Internet Search Data with an Ensemble Penalized Regression Model
Affiliations
Seasonal influenza epidemics cause serious public health problems in China. Search queries-based surveillance was recently proposed to complement traditional monitoring approaches of influenza epidemics. However, developing robust techniques of search query selection and enhancing predictability for influenza epidemics remains a challenge. This study aimed to develop a novel ensemble framework to improve penalized regression models for detecting influenza epidemics by using Baidu search engine query data from China. The ensemble framework applied a combination of bootstrap aggregating (bagging) and rank aggregation method to optimize penalized regression models. Different algorithms including lasso, ridge, elastic net and the algorithms in the proposed ensemble framework were compared by using Baidu search engine queries. Most of the selected search terms captured the peaks and troughs of the time series curves of influenza cases. The predictability of the conventional penalized regression models were improved by the proposed ensemble framework. The elastic net regression model outperformed the compared models, with the minimum prediction errors. We established a Baidu search engine queries-based surveillance model for monitoring influenza epidemics, and the proposed model provides a useful tool to support the public health response to influenza and other infectious diseases.
McClymont H, Lambert S, Barr I, Vardoulakis S, Bambrick H, Hu W J Epidemiol Glob Health. 2024; 14(3):645-657.
PMID: 39141074 PMC: 11442909. DOI: 10.1007/s44197-024-00272-y.
Wei S, Lin S, Wenjing Z, Shaoxia S, Yuejie Y, Yujie H BMC Public Health. 2024; 24(1):513.
PMID: 38369456 PMC: 10875817. DOI: 10.1186/s12889-024-17978-0.
Yang J, Zhou J, Luo T, Xie Y, Wei Y, Mai H Environ Health Prev Med. 2023; 28:68.
PMID: 37926526 PMC: 10636285. DOI: 10.1265/ehpm.23-00141.
Analysis of COVID-19 outbreak in Hubei province based on Tencent's location big data.
Hua L, Ran R, Li T Front Public Health. 2023; 11:1029385.
PMID: 37304123 PMC: 10251770. DOI: 10.3389/fpubh.2023.1029385.
Parums D Med Sci Monit. 2023; 29:e941209.
PMID: 37259578 PMC: 10240961. DOI: 10.12659/MSM.941209.