» Articles » PMID: 29967755

Forecasting Influenza Epidemics by Integrating Internet Search Queries and Traditional Surveillance Data with the Support Vector Machine Regression Model in Liaoning, from 2011 to 2015

Overview
Journal PeerJ
Date 2018 Jul 4
PMID 29967755
Citations 21
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Influenza epidemics pose significant social and economic challenges in China. Internet search query data have been identified as a valuable source for the detection of emerging influenza epidemics. However, the selection of the search queries and the adoption of prediction methods are crucial challenges when it comes to improving predictions. The purpose of this study was to explore the application of the Support Vector Machine (SVM) regression model in merging search engine query data and traditional influenza data.

Methods: The official monthly reported number of influenza cases in Liaoning province in China was acquired from the China National Scientific Data Center for Public Health from January 2011 to December 2015. Based on Baidu Index, a publicly available search engine database, search queries potentially related to influenza over the corresponding period were identified. An SVM regression model was built to be used for predictions, and the choice of three parameters (, γ, ε) in the SVM regression model was determined by leave-one-out cross-validation (LOOCV) during the model construction process. The model's performance was evaluated by the evaluation metrics including Root Mean Square Error, Root Mean Square Percentage Error and Mean Absolute Percentage Error.

Results: In total, 17 search queries related to influenza were generated through the initial query selection approach and were adopted to construct the SVM regression model, including nine queries in the same month, three queries at a lag of one month, one query at a lag of two months and four queries at a lag of three months. The SVM model performed well when with the parameters ( = 2, γ = 0.005, ɛ = 0.0001), based on the ensemble data integrating the influenza surveillance data and Baidu search query data.

Conclusions: The results demonstrated the feasibility of using internet search engine query data as the complementary data source for influenza surveillance and the efficiency of SVM regression model in tracking the influenza epidemics in Liaoning.

Citing Articles

A novel graph neural network based approach for influenza-like illness nowcasting: exploring the interplay of temporal, geographical, and functional spatial features.

Luo J, Wang X, Fan X, He Y, Du X, Chen Y BMC Public Health. 2025; 25(1):408.

PMID: 39893390 PMC: 11786584. DOI: 10.1186/s12889-025-21618-6.


Mapping the Characteristics of Respiratory Infectious Disease Epidemics in China Based on the Baidu Index from November 2022 to January 2023.

Huo D, Zhang T, Han X, Yang L, Wang L, Fan Z China CDC Wkly. 2024; 6(37):939-945.

PMID: 39347451 PMC: 11427341. DOI: 10.46234/ccdcw2024.195.


A Predictive Model of the Start of Annual Influenza Epidemics.

Castro Blanco E, Dalmau Llorca M, Aguilar Martin C, Carrasco-Querol N, Goncalves A, Hernandez Rojas Z Microorganisms. 2024; 12(7).

PMID: 39065025 PMC: 11278734. DOI: 10.3390/microorganisms12071257.


Exploring the Lagged Correlation Between Baidu Index and Influenza-Like Illness - China, 2014-2019.

Han X, Yang J, Luo Y, Huo D, Yu X, Hu X China CDC Wkly. 2024; 6(26):629-634.

PMID: 38966307 PMC: 11219297. DOI: 10.46234/ccdcw2024.084.


Prediction of influenza outbreaks in Fuzhou, China: comparative analysis of forecasting models.

Chen Q, Zheng X, Shi H, Zhou Q, Hu H, Sun M BMC Public Health. 2024; 24(1):1399.

PMID: 38796443 PMC: 11127308. DOI: 10.1186/s12889-024-18583-x.


References
1.
Bouzille G, Poirier C, Campillo-Gimenez B, Aubert M, Chabot M, Chazard E . Leveraging hospital big data to monitor flu epidemics. Comput Methods Programs Biomed. 2017; 154:153-160. DOI: 10.1016/j.cmpb.2017.11.012. View

2.
Wang C, Li Y, Feng W, Liu K, Zhang S, Hu F . Epidemiological Features and Forecast Model Analysis for the Morbidity of Influenza in Ningbo, China, 2006-2014. Int J Environ Res Public Health. 2017; 14(6). PMC: 5486245. DOI: 10.3390/ijerph14060559. View

3.
Olson D, Konty K, Paladini M, Viboud C, Simonsen L . Reassessing Google Flu Trends data for detection of seasonal and pandemic influenza: a comparative epidemiological study at three geographic scales. PLoS Comput Biol. 2013; 9(10):e1003256. PMC: 3798275. DOI: 10.1371/journal.pcbi.1003256. View

4.
Woo H, Cho Y, Shim E, Lee J, Lee C, Kim S . Estimating Influenza Outbreaks Using Both Search Engine Query Data and Social Media Data in South Korea. J Med Internet Res. 2016; 18(7):e177. PMC: 4949385. DOI: 10.2196/jmir.4955. View

5.
Santillana M, Nsoesie E, Mekaru S, Scales D, Brownstein J . Using clinicians' search query data to monitor influenza epidemics. Clin Infect Dis. 2014; 59(10):1446-50. PMC: 4296132. DOI: 10.1093/cid/ciu647. View