» Articles » PMID: 31924279

An Unsupervised Learning Approach to Identify Novel Signatures of Health and Disease from Multimodal Data

Abstract

Background: Modern medicine is rapidly moving towards a data-driven paradigm based on comprehensive multimodal health assessments. Integrated analysis of data from different modalities has the potential of uncovering novel biomarkers and disease signatures.

Methods: We collected 1385 data features from diverse modalities, including metabolome, microbiome, genetics, and advanced imaging, from 1253 individuals and from a longitudinal validation cohort of 1083 individuals. We utilized a combination of unsupervised machine learning methods to identify multimodal biomarker signatures of health and disease risk.

Results: Our method identified a set of cardiometabolic biomarkers that goes beyond standard clinical biomarkers. Stratification of individuals based on the signatures of these biomarkers identified distinct subsets of individuals with similar health statuses. Subset membership was a better predictor for diabetes than established clinical biomarkers such as glucose, insulin resistance, and body mass index. The novel biomarkers in the diabetes signature included 1-stearoyl-2-dihomo-linolenoyl-GPC and 1-(1-enyl-palmitoyl)-2-oleoyl-GPC. Another metabolite, cinnamoylglycine, was identified as a potential biomarker for both gut microbiome health and lean mass percentage. We identified potential early signatures for hypertension and a poor metabolic health outcome. Additionally, we found novel associations between a uremic toxin, p-cresol sulfate, and the abundance of the microbiome genera Intestinimonas and an unclassified genus in the Erysipelotrichaceae family.

Conclusions: Our methodology and results demonstrate the potential of multimodal data integration, from the identification of novel biomarker signatures to a data-driven stratification of individuals into disease subtypes and stages-an essential step towards personalized, preventative health risk assessment.

Citing Articles

Artificial intelligence in the management of metabolic disorders: a comprehensive review.

Anwar A, Rana S, Pathak P J Endocrinol Invest. 2025; .

PMID: 39969797 DOI: 10.1007/s40618-025-02548-x.


Non-targeted metabolomics analysis of fermented traditional Chinese medicine and its impact on growth performance, serum biochemistry, and intestinal microbiome of weaned lambs.

Fan J, Cui H, Mu Z, Yao C, Yang M, Jin Y Sci Rep. 2024; 14(1):20385.

PMID: 39223216 PMC: 11369253. DOI: 10.1038/s41598-024-71516-x.


Identifying diseases symptoms and general rules using supervised and unsupervised machine learning.

Sogandi F Sci Rep. 2024; 14(1):17956.

PMID: 39095606 PMC: 11297332. DOI: 10.1038/s41598-024-69029-8.


Integration of two-dimensional echocardiography: A novel risk indicator for ST-segment elevation myocardial infarction.

Gao H, Wang K, Wang X, Zeng D, Chen Z ESC Heart Fail. 2024; 11(5):3312-3321.

PMID: 38946662 PMC: 11424358. DOI: 10.1002/ehf2.14939.


Data-driven clustering approach to identify novel clusters of high cognitive impairment risk among Chinese community-dwelling elderly people with normal cognition: A national cohort study.

Ran W, Yu Q J Glob Health. 2024; 14:04088.

PMID: 38638099 PMC: 11026990. DOI: 10.7189/jogh.14.04088.


References
1.
Cobb J, Eckhart A, Perichon R, Wulff J, Mitchell M, Adam K . A novel test for IGT utilizing metabolite markers of glucose tolerance. J Diabetes Sci Technol. 2014; 9(1):69-76. PMC: 4495543. DOI: 10.1177/1932296814553622. View

2.
Wikoff W, Anfora A, Liu J, Schultz P, Lesley S, Peters E . Metabolomics analysis reveals large effects of gut microflora on mammalian blood metabolites. Proc Natl Acad Sci U S A. 2009; 106(10):3698-703. PMC: 2656143. DOI: 10.1073/pnas.0812874106. View

3.
Rossi M, Johnson D, Xu H, Carrero J, Pascoe E, French C . Dietary protein-fiber ratio associates with circulating levels of indoxyl sulfate and p-cresyl sulfate in chronic kidney disease patients. Nutr Metab Cardiovasc Dis. 2015; 25(9):860-865. DOI: 10.1016/j.numecd.2015.03.015. View

4.
Perkins B, Caskey C, Brar P, Dec E, Karow D, Kahn A . Precision medicine screening using whole-genome sequencing and advanced imaging to identify disease risk in adults. Proc Natl Acad Sci U S A. 2018; 115(14):3686-3691. PMC: 5889622. DOI: 10.1073/pnas.1706096114. View

5.
Russell W, Duncan S, Scobbie L, Duncan G, Cantlay L, Calder A . Major phenylpropanoid-derived metabolites in the human gut can arise from microbial fermentation of protein. Mol Nutr Food Res. 2013; 57(3):523-35. DOI: 10.1002/mnfr.201200594. View