» Articles » PMID: 24090032

Mspire-Simulator: LC-MS Shotgun Proteomic Simulator for Creating Realistic Gold Standard Data

Overview
Journal J Proteome Res
Specialty Biochemistry
Date 2013 Oct 5
PMID 24090032
Citations 9
Authors
Affiliations
Soon will be listed here.
Abstract

The most important step in any quantitative proteomic pipeline is feature detection (aka peak picking). However, generating quality hand-annotated data sets to validate the algorithms, especially for lower abundance peaks, is nearly impossible. An alternative for creating gold standard data is to simulate it with features closely mimicking real data. We present Mspire-Simulator, a free, open-source shotgun proteomic simulator that goes beyond previous simulation attempts by generating LC-MS features with realistic m/z and intensity variance along with other noise components. It also includes machine-learned models for retention time and peak intensity prediction and a genetic algorithm to custom fit model parameters for experimental data sets. We show that these methods are applicable to data from three different mass spectrometers, including two fundamentally different types, and show visually and analytically that simulated peaks are nearly indistinguishable from actual data. Researchers can use simulated data to rigorously test quantitation software, and proteomic researchers may benefit from overlaying simulated data on actual data sets.

Citing Articles

Simulation of mass spectrometry-based proteomics data with Synthedia.

Leeming M, Ang C, Nie S, Varshney S, Williamson N Bioinform Adv. 2023; 3(1):vbac096.

PMID: 36698761 PMC: 9825309. DOI: 10.1093/bioadv/vbac096.


Normalizing and Correcting Variable and Complex LC-MS Metabolomic Data with the R Package pseudoDrift.

Rodriguez J, Gomez-Cano L, Grotewold E, de Leon N Metabolites. 2022; 12(5).

PMID: 35629939 PMC: 9144304. DOI: 10.3390/metabo12050435.


SMITER-A Python Library for the Simulation of LC-MS/MS Experiments.

Kosters M, Leufken J, Leidel S Genes (Basel). 2021; 12(3).

PMID: 33799543 PMC: 8000309. DOI: 10.3390/genes12030396.


In Silico Optimization of Mass Spectrometry Fragmentation Strategies in Metabolomics.

Wandy J, Davies V, van der Hooft J, Weidt S, Daly R, Rogers S Metabolites. 2019; 9(10).

PMID: 31600991 PMC: 6836109. DOI: 10.3390/metabo9100219.


Accelerating Lipidomic Method Development through Simulation.

Hutchins P, Russell J, Coon J Anal Chem. 2019; 91(15):9698-9706.

PMID: 31298839 PMC: 6716604. DOI: 10.1021/acs.analchem.9b01234.