Regional Infoveillance of COVID-19 Case Rates: Analysis of Search-Engine Query Patterns
Overview
Affiliations
Background: Timely allocation of medical resources for coronavirus disease (COVID-19) requires early detection of regional outbreaks. Internet browsing data may predict case outbreaks in local populations that are yet to be confirmed.
Objective: We investigated whether search-engine query patterns can help to predict COVID-19 case rates at the state and metropolitan area levels in the United States.
Methods: We used regional confirmed case data from the New York Times and Google Trends results from 50 states and 166 county-based designated market areas (DMA). We identified search terms whose activity precedes and correlates with confirmed case rates at the national level. We used univariate regression to construct a composite explanatory variable based on best-fitting search queries offset by temporal lags. We measured the raw and z-transformed Pearson correlation and root-mean-square error (RMSE) of the explanatory variable with out-of-sample case rate data at the state and DMA levels.
Results: Predictions were highly correlated with confirmed case rates at the state (mean r=0.69, 95% CI 0.51-0.81; median RMSE 1.27, IQR 1.48) and DMA levels (mean r=0.51, 95% CI 0.39-0.61; median RMSE 4.38, IQR 1.80), using search data available up to 10 days prior to confirmed case rates. They fit case-rate activity in 49 of 50 states and in 103 of 166 DMA at a significance level of .05.
Conclusions: Identifiable patterns in search query activity may help to predict emerging regional outbreaks of COVID-19, although they remain vulnerable to stochastic changes in search intensity.
Kaur M, Cargill T, Hui K, Vu M, Bragazzi N, Kong J JMIR Form Res. 2024; 8:e46087.
PMID: 38285495 PMC: 10862249. DOI: 10.2196/46087.
Changes to Public Health Surveillance Methods Due to the COVID-19 Pandemic: Scoping Review.
Clark E, Neumann S, Hopkins S, Kostopoulos A, Hagerman L, Dobbins M JMIR Public Health Surveill. 2024; 10:e49185.
PMID: 38241067 PMC: 10837764. DOI: 10.2196/49185.
Do you see what I see? Images of the COVID-19 pandemic through the lens of Google.
Paramita M, Orphanou K, Christoforou E, Otterbacher J, Hopfgartner F Inf Process Manag. 2022; 58(5):102654.
PMID: 36567975 PMC: 9759662. DOI: 10.1016/j.ipm.2021.102654.
Forecasting and Surveillance of COVID-19 Spread Using Google Trends: Literature Review.
Saegner T, Austys D Int J Environ Res Public Health. 2022; 19(19).
PMID: 36231693 PMC: 9566212. DOI: 10.3390/ijerph191912394.
Twitter conversations predict the daily confirmed COVID-19 cases.
Lamsal R, Harwood A, Read M Appl Soft Comput. 2022; 129:109603.
PMID: 36092470 PMC: 9444159. DOI: 10.1016/j.asoc.2022.109603.