» Articles » PMID: 38426232

Transforming Big Data into AI-ready Data for Nutrition and Obesity Research

Abstract

Objective: Big Data are increasingly used in obesity and nutrition research to gain new insights and derive personalized guidance; however, this data in raw form are often not usable. Substantial preprocessing, which requires machine learning (ML), human judgment, and specialized software, is required to transform Big Data into artificial intelligence (AI)- and ML-ready data. These preprocessing steps are the most complex part of the entire modeling pipeline. Understanding the complexity of these steps by the end user is critical for reducing misunderstanding, faulty interpretation, and erroneous downstream conclusions.

Methods: We reviewed three popular obesity/nutrition Big Data sources: microbiome, metabolomics, and accelerometry. The preprocessing pipelines, specialized software, challenges, and how decisions impact final AI- and ML-ready products were detailed.

Results: Opportunities for advances to improve quality control, speed of preprocessing, and intelligent end user consumption were presented.

Conclusions: Big Data have the exciting potential for identifying new modifiable factors that impact obesity research. However, to ensure accurate interpretation of conclusions arising from Big Data, the choices involved in preparing AI- and ML-ready data need to be transparent to investigators and clinicians relying on the conclusions.

Citing Articles

Succinic Acid Improves the Metabolism of High-Fat Diet-Induced Mice and Promotes White Adipose Browning.

Yang Y, Luo L, Li Y, Shi X, Li C, Chai J Nutrients. 2024; 16(22).

PMID: 39599615 PMC: 11597198. DOI: 10.3390/nu16223828.


AI-readiness for Biomedical Data: Bridge2AI Recommendations.

Clark T, Caufield H, Parker J, Al Manir S, Amorim E, Eddy J bioRxiv. 2024; .

PMID: 39484409 PMC: 11526931. DOI: 10.1101/2024.10.23.619844.

References
1.
Shen X, Kellogg R, Panyard D, Bararpour N, Castillo K, Lee-McMullen B . Multi-omics microsampling for the profiling of lifestyle-associated changes in health. Nat Biomed Eng. 2023; 8(1):11-29. PMC: 10805653. DOI: 10.1038/s41551-022-00999-8. View

2.
Berger A, Wielgus K, Young-McCaughan S, Fischer P, Farr L, Lee K . Methodological challenges when using actigraphy in research. J Pain Symptom Manage. 2008; 36(2):191-9. PMC: 2542506. DOI: 10.1016/j.jpainsymman.2007.10.008. View

3.
van Hees V, Sabia S, Anderson K, Denton S, Oliver J, Catt M . A Novel, Open Access Method to Assess Sleep Duration Using a Wrist-Worn Accelerometer. PLoS One. 2015; 10(11):e0142533. PMC: 4646630. DOI: 10.1371/journal.pone.0142533. View

4.
Aguilar-Farias N, Peeters G, Brychta R, Chen K, Brown W . Comparing ActiGraph equations for estimating energy expenditure in older adults. J Sports Sci. 2018; 37(2):188-195. PMC: 6298850. DOI: 10.1080/02640414.2018.1488437. View

5.
Cirulli E, Guo L, Swisher C, Shah N, Huang L, Napier L . Profound Perturbation of the Metabolome in Obesity Is Associated with Health Risk. Cell Metab. 2018; 29(2):488-500.e2. PMC: 6370944. DOI: 10.1016/j.cmet.2018.09.022. View