» Articles » PMID: 24013927

Multi-profile Bayesian Alignment Model for LC-MS Data Analysis with Integration of Internal Standards

Overview
Journal Bioinformatics
Specialty Biology
Date 2013 Sep 10
PMID 24013927
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Liquid chromatography-mass spectrometry (LC-MS) has been widely used for profiling expression levels of biomolecules in various '-omic' studies including proteomics, metabolomics and glycomics. Appropriate LC-MS data preprocessing steps are needed to detect true differences between biological groups. Retention time (RT) alignment, which is required to ensure that ion intensity measurements among multiple LC-MS runs are comparable, is one of the most important yet challenging preprocessing steps. Current alignment approaches estimate RT variability using either single chromatograms or detected peaks, but do not simultaneously take into account the complementary information embedded in the entire LC-MS data.

Results: We propose a Bayesian alignment model for LC-MS data analysis. The alignment model provides estimates of the RT variability along with uncertainty measures. The model enables integration of multiple sources of information including internal standards and clustered chromatograms in a mathematically rigorous framework. We apply the model to LC-MS metabolomic, proteomic and glycomic data. The performance of the model is evaluated based on ground-truth data, by measuring correlation of variation, RT difference across runs and peak-matching performance. We demonstrate that Bayesian alignment model improves significantly the RT alignment performance through appropriate integration of relevant information.

Availability And Implementation: MATLAB code, raw and preprocessed LC-MS data are available at http://omics.georgetown.edu/alignLCMS.html.

Contact: hwr@georgetown.edu.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

Mass Spectrometry-based Metabolomics in Translational Research.

Kim S, Song H, Lee H, Yoo H Adv Exp Med Biol. 2021; 1310:509-531.

PMID: 33834448 DOI: 10.1007/978-981-33-6064-8_19.


Gaussian Process Modeling of Protein Turnover.

Rahman M, Previs S, Kasumov T, Sadygov R J Proteome Res. 2016; 15(7):2115-22.

PMID: 27229456 PMC: 5292319. DOI: 10.1021/acs.jproteome.5b00990.


Preprocessing and Analysis of LC-MS-Based Proteomic Data.

Tsai T, Wang M, Ressom H Methods Mol Biol. 2015; 1362:63-76.

PMID: 26519169 PMC: 4838194. DOI: 10.1007/978-1-4939-3106-4_3.


Current controlled vocabularies are insufficient to uniquely map molecular entities to mass spectrometry signal.

Smith R, Taylor R, Prince J BMC Bioinformatics. 2015; 16 Suppl 7:S2.

PMID: 25952148 PMC: 4423578. DOI: 10.1186/1471-2105-16-S7-S2.


Incorporating peak grouping information for alignment of multiple liquid chromatography-mass spectrometry datasets.

Wandy J, Daly R, Breitling R, Rogers S Bioinformatics. 2015; 31(12):1999-2006.

PMID: 25649621 PMC: 4760236. DOI: 10.1093/bioinformatics/btv072.

References
1.
Zaia J . Mass spectrometry and glycomics. OMICS. 2010; 14(4):401-18. PMC: 3133787. DOI: 10.1089/omi.2009.0146. View

2.
Mann M, Kelleher N . Precision proteomics: the case for high resolution and high mass accuracy. Proc Natl Acad Sci U S A. 2008; 105(47):18132-8. PMC: 2587563. DOI: 10.1073/pnas.0800788105. View

3.
Aebersold R, Mann M . Mass spectrometry-based proteomics. Nature. 2003; 422(6928):198-207. DOI: 10.1038/nature01511. View

4.
Patti G, Yanes O, Siuzdak G . Innovation: Metabolomics: the apogee of the omics trilogy. Nat Rev Mol Cell Biol. 2012; 13(4):263-9. PMC: 3682684. DOI: 10.1038/nrm3314. View

5.
Hawkridge A, Muddiman D . Mass spectrometry-based biomarker discovery: toward a global proteome index of individuality. Annu Rev Anal Chem (Palo Alto Calif). 2010; 2:265-77. PMC: 3140421. DOI: 10.1146/annurev.anchem.1.031207.112942. View