» Articles » PMID: 26381204

MapDIA: Preprocessing and Statistical Analysis of Quantitative Proteomics Data from Data Independent Acquisition Mass Spectrometry

Overview
Journal J Proteomics
Publisher Elsevier
Specialty Biochemistry
Date 2015 Sep 19
PMID 26381204
Citations 78
Authors
Affiliations
Soon will be listed here.
Abstract

Unlabelled: Data independent acquisition (DIA) mass spectrometry is an emerging technique that offers more complete detection and quantification of peptides and proteins across multiple samples. DIA allows fragment-level quantification, which can be considered as repeated measurements of the abundance of the corresponding peptides and proteins in the downstream statistical analysis. However, few statistical approaches are available for aggregating these complex fragment-level data into peptide- or protein-level statistical summaries. In this work, we describe a software package, mapDIA, for statistical analysis of differential protein expression using DIA fragment-level intensities. The workflow consists of three major steps: intensity normalization, peptide/fragment selection, and statistical analysis. First, mapDIA offers normalization of fragment-level intensities by total intensity sums as well as a novel alternative normalization by local intensity sums in retention time space. Second, mapDIA removes outlier observations and selects peptides/fragments that preserve the major quantitative patterns across all samples for each protein. Last, using the selected fragments and peptides, mapDIA performs model-based statistical significance analysis of protein-level differential expression between specified groups of samples. Using a comprehensive set of simulation datasets, we show that mapDIA detects differentially expressed proteins with accurate control of the false discovery rates. We also describe the analysis procedure in detail using two recently published DIA datasets generated for 14-3-3β dynamic interaction network and prostate cancer glycoproteome.

Availability: The software was written in C++ language and the source code is available for free through SourceForge website http://sourceforge.net/projects/mapdia/.This article is part of a Special Issue entitled: Computational Proteomics.

Citing Articles

Cross-Run Hybrid Features Improve the Identification of Data-Independent Acquisition Proteomics.

Liu Y, Mei L, Liang C, Zhong C, Tong M, Yu R ACS Omega. 2024; 9(46):46362-46372.

PMID: 39583733 PMC: 11579728. DOI: 10.1021/acsomega.4c07398.


An Equine Protein Atlas Highlights Synovial Fluid Proteome Dynamics during Experimentally LPS-Induced Arthritis.

Bundgaard L, Arman F, Ahrman E, Walters M, Auf dem Keller U, Malmstrom J J Proteome Res. 2024; 23(11):4849-4863.

PMID: 39395021 PMC: 11536436. DOI: 10.1021/acs.jproteome.4c00125.


Myocardial ultrastructure of human heart failure with preserved ejection fraction.

Meddeb M, Koleini N, Binek A, Keykhaei M, Darehgazani R, Kwon S Nat Cardiovasc Res. 2024; 3(8):907-914.

PMID: 39196036 PMC: 11498130. DOI: 10.1038/s44161-024-00516-x.


Patient subtyping analysis of baseline multi-omic data reveals distinct pre-immune states associated with antibody response to seasonal influenza vaccination.

Sevim Bayrak C, Forst C, Jones D, Gresham D, Pushalkar S, Wu S Clin Immunol. 2024; 266:110333.

PMID: 39089348 PMC: 11340208. DOI: 10.1016/j.clim.2024.110333.


Topical and oral peroxisome proliferator-activated receptor-α agonist ameliorates diabetic corneal neuropathy.

Mansoor H, Yu Lee I, Lin M, Ang H, Xue Y, Krishaa L Sci Rep. 2024; 14(1):13435.

PMID: 38862650 PMC: 11167005. DOI: 10.1038/s41598-024-64451-4.


References
1.
MacLean B, Tomazela D, Shulman N, Chambers M, Finney G, Frewen B . Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics. 2010; 26(7):966-8. PMC: 2844992. DOI: 10.1093/bioinformatics/btq054. View

2.
Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J . Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 2000; 25(1):25-9. PMC: 3037419. DOI: 10.1038/75556. View

3.
Michalski A, Cox J, Mann M . More than 100,000 detectable peptide species elute in single shotgun proteomics runs but the majority is inaccessible to data-dependent LC-MS/MS. J Proteome Res. 2011; 10(4):1785-93. DOI: 10.1021/pr101060v. View

4.
Hornbeck P, Kornhauser J, Tkachev S, Zhang B, Skrzypek E, Murray B . PhosphoSitePlus: a comprehensive resource for investigating the structure and function of experimentally determined post-translational modifications in man and mouse. Nucleic Acids Res. 2011; 40(Database issue):D261-70. PMC: 3245126. DOI: 10.1093/nar/gkr1122. View

5.
Gillet L, Navarro P, Tate S, Rost H, Selevsek N, Reiter L . Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: a new concept for consistent and accurate proteome analysis. Mol Cell Proteomics. 2012; 11(6):O111.016717. PMC: 3433915. DOI: 10.1074/mcp.O111.016717. View