From Hype to Reality: Data Science Enabling Personalized Medicine
Overview
Authors
Affiliations
Background: Personalized, precision, P4, or stratified medicine is understood as a medical approach in which patients are stratified based on their disease subtype, risk, prognosis, or treatment response using specialized diagnostic tests. The key idea is to base medical decisions on individual patient characteristics, including molecular and behavioral biomarkers, rather than on population averages. Personalized medicine is deeply connected to and dependent on data science, specifically machine learning (often named Artificial Intelligence in the mainstream media). While during recent years there has been a lot of enthusiasm about the potential of 'big data' and machine learning-based solutions, there exist only few examples that impact current clinical practice. The lack of impact on clinical practice can largely be attributed to insufficient performance of predictive models, difficulties to interpret complex model predictions, and lack of validation via prospective clinical trials that demonstrate a clear benefit compared to the standard of care. In this paper, we review the potential of state-of-the-art data science approaches for personalized medicine, discuss open challenges, and highlight directions that may help to overcome them in the future.
Conclusions: There is a need for an interdisciplinary effort, including data scientists, physicians, patient advocates, regulatory agencies, and health insurance organizations. Partially unrealistic expectations and concerns about data science-based solutions need to be better managed. In parallel, computational methods must advance more to provide direct benefit to clinical practice.
Pseudonymization tools for medical research: a systematic review.
Abu Attieh H, Muller A, Wirth F, Prasser F BMC Med Inform Decis Mak. 2025; 25(1):128.
PMID: 40075358 PMC: 11905493. DOI: 10.1186/s12911-025-02958-0.
Examining the Use of Machine Learning Algorithms to Enhance the Pediatric Triaging Approach.
Aljubran H, Aljubran M, AlAwami A, Aljubran M, Alkhalifah M, Alkhalifah M Open Access Emerg Med. 2025; 17:51-61.
PMID: 39906028 PMC: 11791337. DOI: 10.2147/OAEM.S494280.
Birkenbihl C, Cuppels M, Boyle R, Klinger H, Langford O, Coughlan G Brain Inform. 2025; 12(1):3.
PMID: 39871006 PMC: 11772644. DOI: 10.1186/s40708-024-00249-4.
Deep learning-based patient stratification for prognostic enrichment of clinical dementia trials.
Birkenbihl C, de Jong J, Yalchyk I, Frohlich H Brain Commun. 2024; 6(6):fcae445.
PMID: 39713242 PMC: 11660909. DOI: 10.1093/braincomms/fcae445.
Machine learning algorithms: why the cup occasionally appears half-empty.
Woodman R Eur J Clin Nutr. 2024; 79(2):87-89.
PMID: 39443687 PMC: 11810781. DOI: 10.1038/s41430-024-01529-2.