» Articles » PMID: 28422149

Monitoring Seasonal Influenza Epidemics by Using Internet Search Data with an Ensemble Penalized Regression Model

Overview
Journal Sci Rep
Specialty Science
Date 2017 Apr 20
PMID 28422149
Citations 21
Authors
Affiliations
Soon will be listed here.
Abstract

Seasonal influenza epidemics cause serious public health problems in China. Search queries-based surveillance was recently proposed to complement traditional monitoring approaches of influenza epidemics. However, developing robust techniques of search query selection and enhancing predictability for influenza epidemics remains a challenge. This study aimed to develop a novel ensemble framework to improve penalized regression models for detecting influenza epidemics by using Baidu search engine query data from China. The ensemble framework applied a combination of bootstrap aggregating (bagging) and rank aggregation method to optimize penalized regression models. Different algorithms including lasso, ridge, elastic net and the algorithms in the proposed ensemble framework were compared by using Baidu search engine queries. Most of the selected search terms captured the peaks and troughs of the time series curves of influenza cases. The predictability of the conventional penalized regression models were improved by the proposed ensemble framework. The elastic net regression model outperformed the compared models, with the minimum prediction errors. We established a Baidu search engine queries-based surveillance model for monitoring influenza epidemics, and the proposed model provides a useful tool to support the public health response to influenza and other infectious diseases.

Citing Articles

Internet-based Surveillance Systems and Infectious Diseases Prediction: An Updated Review of the Last 10 Years and Lessons from the COVID-19 Pandemic.

McClymont H, Lambert S, Barr I, Vardoulakis S, Bambrick H, Hu W J Epidemiol Glob Health. 2024; 14(3):645-657.

PMID: 39141074 PMC: 11442909. DOI: 10.1007/s44197-024-00272-y.


The prediction of influenza-like illness using national influenza surveillance data and Baidu query data.

Wei S, Lin S, Wenjing Z, Shaoxia S, Yuejie Y, Yujie H BMC Public Health. 2024; 24(1):513.

PMID: 38369456 PMC: 10875817. DOI: 10.1186/s12889-024-17978-0.


Predicting pulmonary tuberculosis incidence in China using Baidu search index: an ARIMAX model approach.

Yang J, Zhou J, Luo T, Xie Y, Wei Y, Mai H Environ Health Prev Med. 2023; 28:68.

PMID: 37926526 PMC: 10636285. DOI: 10.1265/ehpm.23-00141.


Analysis of COVID-19 outbreak in Hubei province based on Tencent's location big data.

Hua L, Ran R, Li T Front Public Health. 2023; 11:1029385.

PMID: 37304123 PMC: 10251770. DOI: 10.3389/fpubh.2023.1029385.


Editorial: Infectious Disease Surveillance Using Artificial Intelligence (AI) and its Role in Epidemic and Pandemic Preparedness.

Parums D Med Sci Monit. 2023; 29:e941209.

PMID: 37259578 PMC: 10240961. DOI: 10.12659/MSM.941209.


References
1.
Olson D, Konty K, Paladini M, Viboud C, Simonsen L . Reassessing Google Flu Trends data for detection of seasonal and pandemic influenza: a comparative epidemiological study at three geographic scales. PLoS Comput Biol. 2013; 9(10):e1003256. PMC: 3798275. DOI: 10.1371/journal.pcbi.1003256. View

2.
Pihur V, Datta S, Datta S . Weighted rank aggregation of cluster validation measures: a Monte Carlo cross-entropy approach. Bioinformatics. 2007; 23(13):1607-15. DOI: 10.1093/bioinformatics/btm158. View

3.
Yu H, Feng L, Viboud C, Shay D, Jiang Y, Zhou H . Regional variation in mortality impact of the 2009 A(H1N1) influenza pandemic in China. Influenza Other Respir Viruses. 2013; 7(6):1350-60. PMC: 4634298. DOI: 10.1111/irv.12121. View

4.
Guo P, Zeng F, Hu X, Zhang D, Zhu S, Deng Y . Improved Variable Selection Algorithm Using a LASSO-Type Penalty, with an Application to Assessing Hepatitis B Infection Relevant Factors in Community Residents. PLoS One. 2015; 10(7):e0134151. PMC: 4516242. DOI: 10.1371/journal.pone.0134151. View

5.
Eysenbach G . Infodemiology: tracking flu-related searches on the web for syndromic surveillance. AMIA Annu Symp Proc. 2007; :244-8. PMC: 1839505. View