» Articles » PMID: 31013937

CFM-ID 3.0: Significantly Improved ESI-MS/MS Prediction and Compound Identification

Overview
Journal Metabolites
Publisher MDPI
Date 2019 Apr 25
PMID 31013937
Citations 87
Authors
Affiliations
Soon will be listed here.
Abstract

Metabolite identification for untargeted metabolomics is often hampered by the lack of experimentally collected reference spectra from tandem mass spectrometry (MS/MS). To circumvent this problem, Competitive Fragmentation Modeling-ID (CFM-ID) was developed to accurately predict electrospray ionization-MS/MS (ESI-MS/MS) spectra from chemical structures and to aid in compound identification via MS/MS spectral matching. While earlier versions of CFM-ID performed very well, CFM-ID's performance for predicting the MS/MS spectra of certain classes of compounds, including many lipids, was quite poor. Furthermore, CFM-ID's compound identification capabilities were limited because it did not use experimentally available MS/MS spectra nor did it exploit metadata in its spectral matching algorithm. Here, we describe significant improvements to CFM-ID's performance and speed. These include (1) the implementation of a rule-based fragmentation approach for lipid MS/MS spectral prediction, which greatly improves the speed and accuracy of CFM-ID; (2) the inclusion of experimental MS/MS spectra and other metadata to enhance CFM-ID's compound identification abilities; (3) the development of new scoring functions that improves CFM-ID's accuracy by 21.1%; and (4) the implementation of a chemical classification algorithm that correctly classifies unknown chemicals (based on their MS/MS spectra) in >80% of the cases. This improved version called CFM-ID 3.0 is freely available as a web server. Its source code is also accessible online.

Citing Articles

Optical control of sphingolipid biosynthesis using photoswitchable sphingosines.

Kol M, Novak A, Morstein J, Schroer C, Sokoya T, Mensing S J Lipid Res. 2024; 66(1):100724.

PMID: 39672331 PMC: 11782902. DOI: 10.1016/j.jlr.2024.100724.


Optical control of sphingolipid biosynthesis using photoswitchable sphingosines.

Kol M, Novak A, Morstein J, Schroer C, Sokoya T, Mensing S bioRxiv. 2024; .

PMID: 39484495 PMC: 11527141. DOI: 10.1101/2024.10.24.619506.


Deep learning prediction of electrospray ionization tandem mass spectra of chemically derived molecules.

Chen B, Li H, Huang R, Tang Y, Li F Nat Commun. 2024; 15(1):8396.

PMID: 39333165 PMC: 11436754. DOI: 10.1038/s41467-024-52805-5.


Navigating common pitfalls in metabolite identification and metabolomics bioinformatics.

Novoa-Del-Toro E, Witting M Metabolomics. 2024; 20(5):103.

PMID: 39305388 PMC: 11416380. DOI: 10.1007/s11306-024-02167-2.


Application of a molecular networking approach using LC-HRMS combined with the MetWork webserver for clinical and forensic toxicology.

Magny R, Beauxis Y, Genta-Jouve G, Bourgogne E Heliyon. 2024; 10(17):e36735.

PMID: 39286100 PMC: 11402778. DOI: 10.1016/j.heliyon.2024.e36735.


References
1.
Heinonen M, Rantanen A, Mielikainen T, Kokkonen J, Kiuru J, Ketola R . FiD: a software for ab initio structural identification of product ions from tandem mass spectrometric data. Rapid Commun Mass Spectrom. 2008; 22(19):3043-52. DOI: 10.1002/rcm.3701. View

2.
Fahy E, Subramaniam S, Murphy R, Nishijima M, Raetz C, Shimizu T . Update of the LIPID MAPS comprehensive classification system for lipids. J Lipid Res. 2008; 50 Suppl:S9-14. PMC: 2674711. DOI: 10.1194/jlr.R800095-JLR200. View

3.
Horai H, Arita M, Kanaya S, Nihei Y, Ikeda T, Suwa K . MassBank: a public repository for sharing mass spectral data for life sciences. J Mass Spectrom. 2010; 45(7):703-14. DOI: 10.1002/jms.1777. View

4.
Sawada Y, Nakabayashi R, Yamada Y, Suzuki M, Sato M, Sakata A . RIKEN tandem mass spectral database (ReSpect) for phytochemicals: a plant-specific MS/MS-based data resource and database. Phytochemistry. 2012; 82:38-45. DOI: 10.1016/j.phytochem.2012.07.007. View

5.
Ridder L, van der Hooft J, Verhoeven S, de Vos R, van Schaik R, Vervoort J . Substructure-based annotation of high-resolution multistage MS(n) spectral trees. Rapid Commun Mass Spectrom. 2012; 26(20):2461-71. DOI: 10.1002/rcm.6364. View