» Articles » PMID: 33591285

Development and Validation of Risk Scores for All-Cause Mortality for a Smartphone-Based "General Health Score" App: Prospective Cohort Study Using the UK Biobank

Abstract

Background: Given the established links between an individual's behaviors and lifestyle factors and potentially adverse health outcomes, univariate or simple multivariate health metrics and scores have been developed to quantify general health at a given point in time and estimate risk of negative future outcomes. However, these health metrics may be challenging for widespread use and are unlikely to be successful at capturing the broader determinants of health in the general population. Hence, there is a need for a multidimensional yet widely employable and accessible way to obtain a comprehensive health metric.

Objective: The objective of the study was to develop and validate a novel, easily interpretable, points-based health score ("C-Score") derived from metrics measurable using smartphone components and iterations thereof that utilize statistical modeling and machine learning (ML) approaches.

Methods: A literature review was conducted to identify relevant predictor variables for inclusion in the first iteration of a points-based model. This was followed by a prospective cohort study in a UK Biobank population for the purposes of validating the C-Score and developing and comparatively validating variations of the score using statistical and ML models to assess the balance between expediency and ease of interpretability and model complexity. Primary and secondary outcome measures were discrimination of a points-based score for all-cause mortality within 10 years (Harrell c-statistic) and discrimination and calibration of Cox proportional hazards models and ML models that incorporate C-Score values (or raw data inputs) and other predictors to predict the risk of all-cause mortality within 10 years.

Results: The study cohort comprised 420,560 individuals. During a cohort follow-up of 4,526,452 person-years, there were 16,188 deaths from any cause (3.85%). The points-based model had good discrimination (c-statistic=0.66). There was a 31% relative reduction in risk of all-cause mortality per decile of increasing C-Score (hazard ratio of 0.69, 95% CI 0.663-0.675). A Cox model integrating age and C-Score had improved discrimination (8 percentage points; c-statistic=0.74) and good calibration. ML approaches did not offer improved discrimination over statistical modeling.

Conclusions: The novel health metric ("C-Score") has good predictive capabilities for all-cause mortality within 10 years. Embedding the C-Score within a smartphone app may represent a useful tool for democratized, individualized health risk prediction. A simple Cox model using C-Score and age balances parsimony and accuracy of risk predictions and could be used to produce absolute risk estimations for app users.

Citing Articles

Health literacy in relation to web-based measurement of cognitive function in the home: UK Women's Cohort Study.

Hagger-Johnson G, Reimers S, Greenwood D, Cade J, Gow A BMJ Open. 2025; 15(3):e092528.

PMID: 40054868 PMC: 11887290. DOI: 10.1136/bmjopen-2024-092528.


Correlation between allostatic load index and cumulative mortality: a register-based study of Danish municipalities.

Bruun-Rasmussen N, Napolitano G, Bojesen S, Ellervik C, Holmager T, Rasmussen K BMJ Open. 2024; 14(2):e075697.

PMID: 38346879 PMC: 10862330. DOI: 10.1136/bmjopen-2023-075697.


Development and validation of questionnaire-based machine learning models for predicting all-cause mortality in a representative population of China.

Li Z, Yang N, He L, Wang J, Ping F, Li W Front Public Health. 2023; 11:1033070.

PMID: 36778549 PMC: 9911458. DOI: 10.3389/fpubh.2023.1033070.


A Novel Score for mHealth Apps to Predict and Prevent Mortality: Further Validation and Adaptation to the US Population Using the US National Health and Nutrition Examination Survey Data Set.

Elnakib S, Vecino-Ortiz A, Gibson D, Agarwal S, Trujillo A, Zhu Y J Med Internet Res. 2022; 24(6):e36787.

PMID: 35483022 PMC: 9240932. DOI: 10.2196/36787.


Comparison of Machine Learning Techniques for Mortality Prediction in a Prospective Cohort of Older Adults.

Tedesco S, Andrulli M, Larsson M, Kelly D, Alamaki A, Timmons S Int J Environ Res Public Health. 2021; 18(23).

PMID: 34886532 PMC: 8657506. DOI: 10.3390/ijerph182312806.

References
1.
Shai I, Jiang R, Manson J, Stampfer M, Willett W, Colditz G . Ethnicity, obesity, and risk of type 2 diabetes in women: a 20-year follow-up study. Diabetes Care. 2006; 29(7):1585-90. DOI: 10.2337/dc06-0057. View

2.
Grant S, Collins G, Nashef S . Statistical Primer: developing and validating a risk prediction model. Eur J Cardiothorac Surg. 2018; 54(2):203-208. DOI: 10.1093/ejcts/ezy180. View

3.
Zhang D, Shen X, Qi X . Resting heart rate and all-cause and cardiovascular mortality in the general population: a meta-analysis. CMAJ. 2015; 188(3):E53-E63. PMC: 4754196. DOI: 10.1503/cmaj.150535. View

4.
Tyrer J, Duffy S, Cuzick J . A breast cancer prediction model incorporating familial and personal risk factors. Stat Med. 2004; 23(7):1111-30. DOI: 10.1002/sim.1668. View

5.
Harrell Jr F, Lee K, Mark D . Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996; 15(4):361-87. DOI: 10.1002/(SICI)1097-0258(19960229)15:4<361::AID-SIM168>3.0.CO;2-4. View