» Articles » PMID: 23543913

Translational Biomarker Discovery in Clinical Metabolomics: an Introductory Tutorial

Overview
Journal Metabolomics
Publisher Springer
Specialty Endocrinology
Date 2013 Apr 2
PMID 23543913
Citations 361
Authors
Affiliations
Soon will be listed here.
Abstract

Metabolomics is increasingly being applied towards the identification of biomarkers for disease diagnosis, prognosis and risk prediction. Unfortunately among the many published metabolomic studies focusing on biomarker discovery, there is very little consistency and relatively little rigor in how researchers select, assess or report their candidate biomarkers. In particular, few studies report any measure of sensitivity, specificity, or provide receiver operator characteristic (ROC) curves with associated confidence intervals. Even fewer studies explicitly describe or release the biomarker model used to generate their ROC curves. This is surprising given that for biomarker studies in most other biomedical fields, ROC curve analysis is generally considered the standard method for performance assessment. Because the ultimate goal of biomarker discovery is the translation of those biomarkers to clinical practice, it is clear that the metabolomics community needs to start "speaking the same language" in terms of biomarker analysis and reporting-especially if it wants to see metabolite markers being routinely used in the clinic. In this tutorial, we will first introduce the concept of ROC curves and describe their use in single biomarker analysis for clinical chemistry. This includes the construction of ROC curves, understanding the meaning of area under ROC curves (AUC) and partial AUC, as well as the calculation of confidence intervals. The second part of the tutorial focuses on biomarker analyses within the context of metabolomics. This section describes different statistical and machine learning strategies that can be used to create - biomarker models and explains how these models can be assessed using ROC curves. In the third part of the tutorial we discuss common issues and potential pitfalls associated with different analysis methods and provide readers with a list of nine recommendations for biomarker analysis and reporting. To help readers test, visualize and explore the concepts presented in this tutorial, we also introduce a web-based tool called ROCCET (ROC Curve Explorer & Tester, http://www.roccet.ca). ROCCET was originally developed as a teaching aid but it can also serve as a training and testing resource to assist metabolomics researchers build biomarker models and conduct a range of common ROC curve analyses for biomarker studies.

Citing Articles

Tryptophan metabolites profile predict remission with dietary therapy in pediatric Crohn's disease.

Sigall Boneh R, van der Kruk N, Wine E, Verburgt C, de Meij T, Lowenberg M Therap Adv Gastroenterol. 2025; 18:17562848251323004.

PMID: 40012837 PMC: 11863242. DOI: 10.1177/17562848251323004.


Rapid exometabolome footprinting combined with multivariate statistics: A powerful tool for bioprocess optimization.

Reiter A, Wesseling L, Wiechert W, Oldiges M Eng Life Sci. 2025; 25(2):2300222.

PMID: 39990767 PMC: 11842285. DOI: 10.1002/elsc.202300222.


Specialized Metabolite Profiling-Based Variations of Watercress Leaves ( R.Br.) from Hydroponic and Aquaponic Systems.

Buitrago-Villanueva I, Barbosa-Cornelio R, Coy-Barrera E Molecules. 2025; 30(2.

PMID: 39860275 PMC: 11767583. DOI: 10.3390/molecules30020406.


Integrating proteomics and metabolomics to elucidate the regulatory mechanisms of pimpled egg production in chickens: Multi-omics analysis of the mechanism of pimpled egg formation.

Li W, Ma X, Li X, Zhang X, Sun Y, Ning C Poult Sci. 2025; 104(2):104818.

PMID: 39827695 PMC: 11787586. DOI: 10.1016/j.psj.2025.104818.


A novel approach to identify optimal metabotypes of elongase and desaturase activities in prevention of acute coronary syndrome.

Tintle N, Newman J, Shearer G Metabolomics. 2025; 11(5):1327-1337.

PMID: 39777108 PMC: 11706515. DOI: 10.1007/s11306-015-0787-6.


References
1.
Sansone S, Rocca-Serra P, Field D, Maguire E, Taylor C, Hofmann O . Toward interoperable bioscience data. Nat Genet. 2012; 44(2):121-6. PMC: 3428019. DOI: 10.1038/ng.1054. View

2.
Jarvis R, Goodacre R . Genetic algorithm optimization for pre-processing and variable selection of spectroscopic data. Bioinformatics. 2004; 21(7):860-8. DOI: 10.1093/bioinformatics/bti102. View

3.
. Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Clin Pharmacol Ther. 2001; 69(3):89-95. DOI: 10.1067/mcp.2001.113989. View

4.
Bourgon R, Gentleman R, Huber W . Independent filtering increases detection power for high-throughput experiments. Proc Natl Acad Sci U S A. 2010; 107(21):9546-51. PMC: 2906865. DOI: 10.1073/pnas.0914005107. View

5.
Arkin C, Wachtel M . How many patients are necessary to assess test performance?. JAMA. 1990; 263(2):275-8. View