» Articles » PMID: 21955121

Data Analysis Strategy for Maximizing High-confidence Protein Identifications in Complex Proteomes Such As Human Tumor Secretomes and Human Serum

Overview
Journal J Proteome Res
Specialty Biochemistry
Date 2011 Sep 30
PMID 21955121
Citations 9
Authors
Affiliations
Soon will be listed here.
Abstract

Detection of biologically interesting, low-abundance proteins in complex proteomes such as serum typically requires extensive fractionation and high-performance mass spectrometers. Processing of the resulting large data sets involves trade-offs between confidence of identification and depth of protein coverage; that is, higher stringency filters preferentially reduce the number of low-abundance proteins identified. In the current study, an alternative database search and results filtering strategies were evaluated using test samples ranging from purified proteins to ovarian tumor secretomes and human serum to maximize peptide and protein coverage. Full and partial tryptic searches were compared because substantial numbers of partial tryptic peptides were observed in all samples, and the proportion of partial tryptic peptides was particularly high for serum. When data filters that yielded similar false discovery rates (FDR) were used, full tryptic searches detected far fewer peptides than partial tryptic searches. In contrast to the common practice of using full tryptic specificity and a narrow precursor mass tolerance, more proteins and peptides could be confidently identified using a partial tryptic database search with a 100 ppm precursor mass tolerance followed by filtering of results using 10 ppm mass error and full tryptic boundaries.

Citing Articles

Identification and verification of plasma protein biomarkers that accurately identify an ectopic pregnancy.

Beer L, Yin X, Ding J, Senapati S, Sammel M, Barnhart K Clin Proteomics. 2023; 20(1):37.

PMID: 37715129 PMC: 10503165. DOI: 10.1186/s12014-023-09425-w.


Proteomic Analysis of Extracellular Vesicles Derived from MDA-MB-231 Cells in Microgravity.

Chen Y, Xue F, Russo A, Wan Y Protein J. 2021; 40(1):108-118.

PMID: 33387250 PMC: 8297522. DOI: 10.1007/s10930-020-09949-2.


CLIC1 and CLIC4 complement CA125 as a diagnostic biomarker panel for all subtypes of epithelial ovarian cancer.

Singha B, Harper S, Goldman A, Bitler B, Aird K, Borowsky M Sci Rep. 2018; 8(1):14725.

PMID: 30282979 PMC: 6170428. DOI: 10.1038/s41598-018-32885-2.


JUMPg: An Integrative Proteogenomics Pipeline Identifying Unannotated Proteins in Human Brain and Cancer Cells.

Li Y, Wang X, Cho J, Shaw T, Wu Z, Bai B J Proteome Res. 2016; 15(7):2309-20.

PMID: 27225868 PMC: 5033046. DOI: 10.1021/acs.jproteome.6b00344.


Comparative secretome analysis of epithelial and mesenchymal subpopulations of head and neck squamous cell carcinoma identifies S100A4 as a potential therapeutic target.

Rasanen K, Sriswasdi S, Valiga A, Tang H, Zhang G, Perego M Mol Cell Proteomics. 2013; 12(12):3778-92.

PMID: 24037664 PMC: 3861723. DOI: 10.1074/mcp.M113.029587.


References
1.
Elias J, Gygi S . Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat Methods. 2007; 4(3):207-14. DOI: 10.1038/nmeth1019. View

2.
Haas W, Faherty B, Gerber S, Elias J, Beausoleil S, Bakalarski C . Optimization and use of peptide mass measurement accuracy in shotgun proteomics. Mol Cell Proteomics. 2006; 5(7):1326-37. DOI: 10.1074/mcp.M500339-MCP200. View

3.
Hsieh E, Hoopmann M, MacLean B, MacCoss M . Comparison of database search strategies for high precursor mass accuracy MS/MS data. J Proteome Res. 2009; 9(2):1138-43. PMC: 2818881. DOI: 10.1021/pr900816a. View

4.
Moore R, Young M, Lee T . Qscore: an algorithm for evaluating SEQUEST database search results. J Am Soc Mass Spectrom. 2002; 13(4):378-86. DOI: 10.1016/S1044-0305(02)00352-5. View

5.
Rodriguez J, Gupta N, Smith R, Pevzner P . Does trypsin cut before proline?. J Proteome Res. 2007; 7(1):300-5. DOI: 10.1021/pr0705035. View