» Articles » PMID: 20307295

In Silico Fragmentation for Computer Assisted Identification of Metabolite Mass Spectra

Overview
Publisher Biomed Central
Specialty Biology
Date 2010 Mar 24
PMID 20307295
Citations 219
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Mass spectrometry has become the analytical method of choice in metabolomics research. The identification of unknown compounds is the main bottleneck. In addition to the precursor mass, tandem MS spectra carry informative fragment peaks, but the coverage of spectral libraries of measured reference compounds are far from covering the complete chemical space. Compound libraries such as PubChem or KEGG describe a larger number of compounds, which can be used to compare their in silico fragmentation with spectra of unknown metabolites.

Results: We created the MetFrag suite to obtain a candidate list from compound libraries based on the precursor mass, subsequently ranked by the agreement between measured and in silico fragments. In the evaluation MetFrag was able to rank most of the correct compounds within the top 3 candidates returned by an exact mass query in KEGG. Compared to a previously published study, MetFrag obtained better results than the commercial MassFrontier software. Especially for large compound libraries, the candidates with a good score show a high structural similarity or just different stereochemistry, a subsequent clustering based on chemical distances reduces this redundancy. The in silico fragmentation requires less than a second to process a molecule, and MetFrag performs a search in KEGG or PubChem on average within 30 to 300 seconds, respectively, on an average desktop PC.

Conclusions: We presented a method that is able to identify small molecules from tandem MS measurements, even without spectral reference data or a large set of fragmentation rules. With today's massive general purpose compound libraries we obtain dozens of very similar candidates, which still allows a confident estimate of the correct compound class. Our tool MetFrag improves the identification of unknown substances from tandem MS spectra and delivers better results than comparable commercial software. MetFrag is available through a web application, web services and as java library. The web frontend allows the end-user to analyse single spectra and browse the results, whereas the web service and console application are aimed to perform batch searches and evaluation.

Citing Articles

Mining microbial and metabolic dark matter in extreme environments: a roadmap for harnessing the power of multi-omics data.

Han J, Li S, Li W, Dong L Adv Biotechnol (Singap). 2025; 2(3):26.

PMID: 39883228 PMC: 11740847. DOI: 10.1007/s44307-024-00034-8.


Introducing "Identification Probability" for Automated and Transferable Assessment of Metabolite Identification Confidence in Metabolomics and Related Studies.

Metz T, Chang C, Gautam V, Anjum A, Tian S, Wang F Anal Chem. 2024; 97(1):1-11.

PMID: 39699939 PMC: 11740175. DOI: 10.1021/acs.analchem.4c04060.


JESTR: Joint Embedding Space Technique for Ranking Candidate Molecules for the Annotation of Untargeted Metabolomics Data.

Kalia A, Krishnan D, Hassoun S ArXiv. 2024; .

PMID: 39606728 PMC: 11601792.


Impacts of Plu kaow ( Thunb.) Ethanolic Extract on Diabetes and Dyslipidemia in STZ Induced Diabetic Rats: Phytochemical Profiling, Cheminformatics Analyses, and Molecular Docking Studies.

Rahman S, Klamrak A, Nopkuesuk N, Nabnueangsap J, Janpan P, Choowongkomon K Antioxidants (Basel). 2024; 13(9).

PMID: 39334723 PMC: 11428413. DOI: 10.3390/antiox13091064.


Pollution gradients shape microbial communities associated with Ae. albopictus larval habitats in urban community gardens.

Duval P, Martin E, Vallon L, Antonelli P, Girard M, Signoret A FEMS Microbiol Ecol. 2024; 100(11).

PMID: 39327012 PMC: 11523617. DOI: 10.1093/femsec/fiae129.


References
1.
Sumner L, Amberg A, Barrett D, Beale M, Beger R, Daykin C . Proposed minimum reporting standards for chemical analysis Chemical Analysis Working Group (CAWG) Metabolomics Standards Initiative (MSI). Metabolomics. 2013; 3(3):211-221. PMC: 3772505. DOI: 10.1007/s11306-007-0082-2. View

2.
Heinonen M, Rantanen A, Mielikainen T, Kokkonen J, Kiuru J, Ketola R . FiD: a software for ab initio structural identification of product ions from tandem mass spectrometric data. Rapid Commun Mass Spectrom. 2008; 22(19):3043-52. DOI: 10.1002/rcm.3701. View

3.
Smith C, OMaille G, Want E, Qin C, Trauger S, Brandon T . METLIN: a metabolite mass spectral database. Ther Drug Monit. 2006; 27(6):747-51. DOI: 10.1097/01.ftd.0000179845.53213.39. View

4.
Kopka J, Schauer N, Krueger S, Birkemeyer C, Usadel B, Bergmuller E . GMD@CSB.DB: the Golm Metabolome Database. Bioinformatics. 2004; 21(8):1635-8. DOI: 10.1093/bioinformatics/bti236. View

5.
Stein S, Scott D . Optimization and testing of mass spectral library search algorithms for compound identification. J Am Soc Mass Spectrom. 2013; 5(9):859-66. DOI: 10.1016/1044-0305(94)87009-8. View