National and Local Influenza Surveillance Through Twitter: an Analysis of the 2012-2013 Influenza Epidemic
Overview
Affiliations
Social media have been proposed as a data source for influenza surveillance because they have the potential to offer real-time access to millions of short, geographically localized messages containing information regarding personal well-being. However, accuracy of social media surveillance systems declines with media attention because media attention increases "chatter" - messages that are about influenza but that do not pertain to an actual infection - masking signs of true influenza prevalence. This paper summarizes our recently developed influenza infection detection algorithm that automatically distinguishes relevant tweets from other chatter, and we describe our current influenza surveillance system which was actively deployed during the full 2012-2013 influenza season. Our objective was to analyze the performance of this system during the most recent 2012-2013 influenza season and to analyze the performance at multiple levels of geographic granularity, unlike past studies that focused on national or regional surveillance. Our system's influenza prevalence estimates were strongly correlated with surveillance data from the Centers for Disease Control and Prevention for the United States (r = 0.93, p < 0.001) as well as surveillance data from the Department of Health and Mental Hygiene of New York City (r = 0.88, p < 0.001). Our system detected the weekly change in direction (increasing or decreasing) of influenza prevalence with 85% accuracy, a nearly twofold increase over a simpler model, demonstrating the utility of explicitly distinguishing infection tweets from other chatter.
Fang G, Hong Z, Chen G, Wang J PLoS One. 2024; 19(8):e0308870.
PMID: 39178287 PMC: 11343469. DOI: 10.1371/journal.pone.0308870.
Rao V, Valdez D, Muralidharan R, Agley J, Eddens K, Dendukuri A J Med Internet Res. 2024; 26:e57885.
PMID: 39178036 PMC: 11380061. DOI: 10.2196/57885.
Robust language-based mental health assessments in time and space through social media.
Mangalik S, Eichstaedt J, Giorgi S, Mun J, Ahmed F, Gill G NPJ Digit Med. 2024; 7(1):109.
PMID: 38698174 PMC: 11065872. DOI: 10.1038/s41746-024-01100-0.
The geography of corporate fake news.
Darendeli A, Sun A, Tay W PLoS One. 2024; 19(4):e0301364.
PMID: 38630681 PMC: 11023451. DOI: 10.1371/journal.pone.0301364.
Using geospatial social media data for infectious disease studies: a systematic review.
Jing F, Li Z, Qiao S, Zhang J, Olatosi B, Li X Int J Digit Earth. 2023; 16(1):130-157.
PMID: 37997607 PMC: 10664840. DOI: 10.1080/17538947.2022.2161652.