» Articles » PMID: 37990023

Assessing the Value of Integrating National Longitudinal Shopping Data into Respiratory Disease Forecasting Models

Overview
Journal Nat Commun
Specialty Biology
Date 2023 Nov 22
PMID 37990023
Authors
Affiliations
Soon will be listed here.
Abstract

The COVID-19 pandemic led to unparalleled pressure on healthcare services. Improved healthcare planning in relation to diseases affecting the respiratory system has consequently become a key concern. We investigated the value of integrating sales of non-prescription medications commonly bought for managing respiratory symptoms, to improve forecasting of weekly registered deaths from respiratory disease at local levels across England, by using over 2 billion transactions logged by a UK high street retailer from March 2016 to March 2020. We report the results from the novel AI (Artificial Intelligence) explainability variable importance tool Model Class Reliance implemented on the PADRUS model (Prediction of Amount of Deaths by Respiratory disease Using Sales). PADRUS is a machine learning model optimised to predict registered deaths from respiratory disease in 314 local authority areas across England through the integration of shopping sales data and focused on purchases of non-prescription medications. We found strong evidence that models incorporating sales data significantly out-perform other models that solely use variables traditionally associated with respiratory disease (e.g. sociodemographics and weather data). Accuracy gains are highest (increases in R (coefficient of determination) between 0.09 to 0.11) in periods of maximum risk to the general public. Results demonstrate the potential to utilise sales data to monitor population health with information at a high level of geographic granularity.

Citing Articles

Harnessing digital footprint data for population health: a discussion on collaboration, challenges and opportunities in the UK.

Burgess R, Dolan E, Poon N, Jenneson V, Pontin F, Sivill T BMJ Health Care Inform. 2024; 31(1).

PMID: 39343444 PMC: 11448216. DOI: 10.1136/bmjhci-2024-101119.


A New Approach for Understanding International Hospital Bed Numbers and Application to Local Area Bed Demand and Capacity Planning.

Jones R Int J Environ Res Public Health. 2024; 21(8).

PMID: 39200645 PMC: 11353596. DOI: 10.3390/ijerph21081035.


Assessing the value of integrating national longitudinal shopping data into respiratory disease forecasting models.

Dolan E, Goulding J, Marshall H, Smith G, Long G, Tata L Nat Commun. 2023; 14(1):7258.

PMID: 37990023 PMC: 10663456. DOI: 10.1038/s41467-023-42776-4.

References
1.
Hofman J, Watts D, Athey S, Garip F, Griffiths T, Kleinberg J . Integrating explanation and prediction in computational social science. Nature. 2021; 595(7866):181-188. DOI: 10.1038/s41586-021-03659-0. View

2.
Chang S, Pierson E, Koh P, Gerardin J, Redbird B, Grusky D . Mobility network models of COVID-19 explain inequities and inform reopening. Nature. 2020; 589(7840):82-87. DOI: 10.1038/s41586-020-2923-3. View

3.
Eyles E, Redaniel M, Jones T, Prat M, Keen T . Can we accurately forecast non-elective bed occupancy and admissions in the NHS? A time-series MSARIMA analysis of longitudinal data from an NHS Trust. BMJ Open. 2022; 12(4):e056523. PMC: 9021768. DOI: 10.1136/bmjopen-2021-056523. View

4.
Davies G, Finch R . Sales of over-the-counter remedies as an early warning system for winter bed crises. Clin Microbiol Infect. 2003; 9(8):858-63. DOI: 10.1046/j.1469-0691.2003.00693.x. View

5.
Park H, Jung H, On J, Park S, Kang H . Digital Epidemiology: Use of Digital Data Collected for Non-epidemiological Purposes in Epidemiological Studies. Healthc Inform Res. 2018; 24(4):253-262. PMC: 6230537. DOI: 10.4258/hir.2018.24.4.253. View