» Articles » PMID: 28914531

Systems-Level Annotation of a Metabolomics Data Set Reduces 25 000 Features to Fewer Than 1000 Unique Metabolites

Overview
Journal Anal Chem
Specialty Chemistry
Date 2017 Sep 16
PMID 28914531
Citations 118
Authors
Affiliations
Soon will be listed here.
Abstract

When using liquid chromatography/mass spectrometry (LC/MS) to perform untargeted metabolomics, it is now routine to detect tens of thousands of features from biological samples. Poor understanding of the data, however, has complicated interpretation and masked the number of unique metabolites actually being measured in an experiment. Here we place an upper bound on the number of unique metabolites detected in Escherichia coli samples analyzed with one untargeted metabolomics method. We first group multiple features arising from the same analyte, which we call "degenerate features", using a context-driven annotation approach. Surprisingly, this analysis revealed thousands of previously unreported degeneracies that reduced the number of unique analytes to ∼2961. We then applied an orthogonal approach to remove nonbiological features from the data using the C-based credentialing technology. This further reduced the number of unique analytes to less than 1000. Our 90% reduction in data is 5-fold greater than previously published studies. On the basis of the results, we propose an alternative approach to untargeted metabolomics that relies on thoroughly annotated reference data sets. To this end, we introduce the creDBle database ( http://creDBle.wustl.edu ), which contains accurate mass, retention time, and MS/MS fragmentation data as well as annotations of all credentialed features.

Citing Articles

A systematic analysis of in-source fragments in LC-MS metabolomics.

Chi Y, Mitchell J, Li S bioRxiv. 2025; .

PMID: 39975275 PMC: 11838597. DOI: 10.1101/2025.02.04.636472.


A Software Tool for Rapid and Automated Preprocessing of Large-Scale Serum Metabolomic Data by Multisegment Injection-Capillary Electrophoresis-Mass Spectrometry.

Helmeczi E, Kroezen Z, Shanmuganathan M, Stanciu A, Martinez V, Kurysko N Anal Chem. 2024; 97(1):175-184.

PMID: 39729551 PMC: 11740174. DOI: 10.1021/acs.analchem.4c03513.


Annotating full-scan MS data using tandem MS libraries.

Xing S, Charron-Lamoureux V, Abiead Y, Dorrestein P bioRxiv. 2024; .

PMID: 39464143 PMC: 11507738. DOI: 10.1101/2024.10.14.618269.


Plasma metabolomics profiles and breast cancer risk.

Wu H, Lai Y, Liao Y, Deyssenroth M, Miller G, Santella R Breast Cancer Res. 2024; 26(1):141.

PMID: 39385226 PMC: 11463119. DOI: 10.1186/s13058-024-01896-5.


Distinguishing Artifactual Fatty Acid Dimers from Fatty Acid Esters of Hydroxy Fatty Acids in Untargeted LC-MS Pipelines.

Nelson A, Queathem E, Puchalska P Methods Mol Biol. 2024; 2855:67-84.

PMID: 39354301 DOI: 10.1007/978-1-0716-4116-3_4.


References
1.
Benton H, Ivanisevic J, Mahieu N, Kurczy M, Johnson C, Franco L . Autonomous metabolomics for rapid metabolite identification in global profiling. Anal Chem. 2014; 87(2):884-91. PMC: 4303330. DOI: 10.1021/ac5025649. View

2.
Stupp G, Clendinen C, Ajredini R, Szewc M, Garrett T, Menger R . Isotopic ratio outlier analysis global metabolomics of Caenorhabditis elegans. Anal Chem. 2013; 85(24):11858-11865. PMC: 3921967. DOI: 10.1021/ac4025413. View

3.
Bueschl C, Kluger B, Lemmens M, Adam G, Wiesenberger G, Maschietto V . A novel stable isotope labelling assisted workflow for improved untargeted LC-HRMS based metabolomics research. Metabolomics. 2014; 10(4):754-769. PMC: 4098048. DOI: 10.1007/s11306-013-0611-0. View

4.
Zeng Z, Liu X, Dai W, Yin P, Zhou L, Huang Q . Ion fusion of high-resolution LC-MS-based metabolomics data to discover more reliable biomarkers. Anal Chem. 2014; 86(8):3793-800. DOI: 10.1021/ac500878x. View

5.
Kuhl C, Tautenhahn R, Bottcher C, Larson T, Neumann S . CAMERA: an integrated strategy for compound spectra extraction and annotation of liquid chromatography/mass spectrometry data sets. Anal Chem. 2011; 84(1):283-9. PMC: 3658281. DOI: 10.1021/ac202450g. View