» Articles » PMID: 33606879

Perspective: Big Data and Machine Learning Could Help Advance Nutritional Epidemiology

Overview
Journal Adv Nutr
Publisher Elsevier
Date 2021 Feb 19
PMID 33606879
Citations 21
Authors
Affiliations
Soon will be listed here.
Abstract

The field of nutritional epidemiology faces challenges posed by measurement error, diet as a complex exposure, and residual confounding. The objective of this perspective article is to highlight how developments in big data and machine learning can help address these challenges. New methods of collecting 24-h dietary recalls and recording diet could enable larger samples and more repeated measures to increase statistical power and measurement precision. In addition, use of machine learning to automatically classify pictures of food could become a useful complimentary method to help improve precision and validity of dietary measurements. Diet is complex due to thousands of different foods that are consumed in varying proportions, fluctuating quantities over time, and differing combinations. Current dietary pattern methods may not integrate sufficient dietary variation, and most traditional modeling approaches have limited incorporation of interactions and nonlinearity. Machine learning could help better model diet as a complex exposure with nonadditive and nonlinear associations. Last, novel big data sources could help avoid unmeasured confounding by offering more covariates, including both omics and features derived from unstructured data with machine learning methods. These opportunities notwithstanding, application of big data and machine learning must be approached cautiously to ensure quality of dietary measurements, avoid overfitting, and confirm accurate interpretations. Greater use of machine learning and big data would also require substantial investments in training, collaborations, and computing infrastructure. Overall, we propose that judicious application of big data and machine learning in nutrition science could offer new means of dietary measurement, more tools to model the complexity of diet and its relations with diseases, and additional potential ways of addressing confounding.

Citing Articles

A comparison of development methods used to define portion sizes in food-based dietary guidelines around the world.

Salesse F, Eldridge A, Mak T, Gibney E Front Nutr. 2025; 12:1532926.

PMID: 40013165 PMC: 11860067. DOI: 10.3389/fnut.2025.1532926.


The development and validation of a prediction model for post-AKI outcomes of pediatric inpatients.

Zhang C, Liu X, Yan R, Nie X, Peng Y, Zhou N Clin Kidney J. 2025; 18(2):sfaf007.

PMID: 39991652 PMC: 11843026. DOI: 10.1093/ckj/sfaf007.


Challenges for Predictive Modeling With Neural Network Techniques Using Error-Prone Dietary Intake Data.

Spicker D, Nazemi A, Hutchinson J, Fieguth P, Kirkpatrick S, Wallace M Stat Med. 2025; 44(5):e70013.

PMID: 39921576 PMC: 11806516. DOI: 10.1002/sim.70013.


Nutritional intelligence in the food system: Combining food, health, data and AI expertise.

McCarthy D Nutr Bull. 2025; 50(1):142-150.

PMID: 39799464 PMC: 11815607. DOI: 10.1111/nbu.12729.


Tailoring the Nutritional Composition of Italian Foods to the US Nutrition5k Dataset for Food Image Recognition: Challenges and a Comparative Analysis.

Bianco R, Marinoni M, Coluccia S, Carioni G, Fiori F, Gnagnarella P Nutrients. 2024; 16(19).

PMID: 39408306 PMC: 11479105. DOI: 10.3390/nu16193339.


References
1.
Bi Q, Goodman K, Kaminsky J, Lessler J . What is Machine Learning? A Primer for the Epidemiologist. Am J Epidemiol. 2019; 188(12):2222-2239. DOI: 10.1093/aje/kwz189. View

2.
Wyss R, Ellis A, Brookhart M, Girman C, Funk M, LoCasale R . The role of prediction modeling in propensity score estimation: an evaluation of logistic regression, bCART, and the covariate-balancing propensity score. Am J Epidemiol. 2014; 180(6):645-55. PMC: 4157700. DOI: 10.1093/aje/kwu181. View

3.
Ioannidis J . Unreformed nutritional epidemiology: a lamp post in the dark forest. Eur J Epidemiol. 2019; 34(4):327-331. DOI: 10.1007/s10654-019-00487-5. View

4.
Recio-Rodriguez J, Rodriguez-Martin C, Gonzalez-Sanchez J, Rodriguez-Sanchez E, Martin-Borras C, Martinez-Vizcaino V . EVIDENT Smartphone App, a New Method for the Dietary Record: Comparison With a Food Frequency Questionnaire. JMIR Mhealth Uhealth. 2019; 7(2):e11463. PMC: 6384535. DOI: 10.2196/11463. View

5.
Naska A, Lagiou A, Lagiou P . Dietary assessment methods in epidemiological research: current state of the art and future prospects. F1000Res. 2017; 6:926. PMC: 5482335. DOI: 10.12688/f1000research.10703.1. View