» Articles » PMID: 21063948

Spectra, Chromatograms, Metadata: MzML-the Standard Data Format for Mass Spectrometer Output

Overview
Specialty Molecular Biology
Date 2010 Nov 11
PMID 21063948
Citations 8
Authors
Affiliations
Soon will be listed here.
Abstract

This chapter describes Mass Spectrometry Markup Language (mzML), an XML-based and vendor-neutral standard data format for storage and exchange of mass spectrometer output like raw spectra and peak lists. It is intended to replace its two precursor data formats (mzData and mzXML), which had been developed independently a few years earlier. Hence, with the release of mzML, the problem of having two different formats for the same purposes is solved, and with it the duplicated effort of maintaining and supporting two data formats. The new format has been developed by a broad-based consortium of major instrument vendors, software vendors, and academic researchers under the aegis of the Human Proteome Organisation (HUPO), Proteomics Standards Initiative (PSI), with full participation of the main developers of the precursor formats. This comprehensive approach helped mzML to become a generally accepted standard. Furthermore, the collaborative development insured that mzML has adopted the best features of its precursor formats. In this chapter, we discuss mzML's development history, its design principles and use cases, as well as its main building components. We also present the available documentation, an example file, and validation software for mzML.

Citing Articles

OmicsSuite: a customized and pipelined suite for analysis and visualization of multi-omics big data.

Miao B, Dong W, Gu Y, Han Z, Luo X, Ke C Hortic Res. 2023; 10(11):uhad195.

PMID: 38023482 PMC: 10673651. DOI: 10.1093/hr/uhad195.


A Current Encyclopedia of Bioinformatics Tools, Data Formats and Resources for Mass Spectrometry Lipidomics.

Hoffmann N, Mayer G, Has C, Kopczynski D, Al Machot F, Schwudke D Metabolites. 2022; 12(7).

PMID: 35888710 PMC: 9319858. DOI: 10.3390/metabo12070584.


PERCEPTRON: an open-source GPU-accelerated proteoform identification pipeline for top-down proteomics.

Khalid M, Iman K, Ghafoor A, Saboor M, Ali A, Muaz U Nucleic Acids Res. 2021; 49(W1):W510-W515.

PMID: 33999207 PMC: 8262694. DOI: 10.1093/nar/gkab368.


Proteome Discoverer-A Community Enhanced Data Processing Suite for Protein Informatics.

Orsburn B Proteomes. 2021; 9(1).

PMID: 33806881 PMC: 8006021. DOI: 10.3390/proteomes9010015.


Implementing FAIR data management within the German Network for Bioinformatics Infrastructure (de.NBI) exemplified by selected use cases.

Mayer G, Muller W, Schork K, Uszkoreit J, Weidemann A, Wittig U Brief Bioinform. 2021; 22(5).

PMID: 33589928 PMC: 8425304. DOI: 10.1093/bib/bbab010.