» Articles » PMID: 26076430

A Mass-tolerant Database Search Identifies a Large Proportion of Unassigned Spectra in Shotgun Proteomics As Modified Peptides

Overview
Journal Nat Biotechnol
Specialty Biotechnology
Date 2015 Jun 16
PMID 26076430
Citations 158
Authors
Affiliations
Soon will be listed here.
Abstract

Fewer than half of all tandem mass spectrometry (MS/MS) spectra acquired in shotgun proteomics experiments are typically matched to a peptide with high confidence. Here we determine the identity of unassigned peptides using an ultra-tolerant Sequest database search that allows peptide matching even with modifications of unknown masses up to ± 500 Da. In a proteome-wide data set on HEK293 cells (9,513 proteins and 396,736 peptides), this approach matched an additional 184,000 modified peptides, which were linked to biological and chemical modifications representing 523 distinct mass bins, including phosphorylation, glycosylation and methylation. We localized all unknown modification masses to specific regions within a peptide. Known modifications were assigned to the correct amino acids with frequencies >90%. We conclude that at least one-third of unassigned spectra arise from peptides with substoichiometric modifications.

Citing Articles

C-terminal amides mark proteins for degradation via SCF-FBXO31.

Muhar M, Farnung J, Cernakova M, Hofmann R, Henneberg L, Pfleiderer M Nature. 2025; 638(8050):519-527.

PMID: 39880951 PMC: 11821526. DOI: 10.1038/s41586-024-08475-w.


Proteomic Profiling Towards a Better Understanding of Genetic Based Muscular Diseases: The Current Picture and a Look to the Future.

Pauper M, Hentschel A, Tiburcy M, Beltran S, Ruck T, Schara-Schmidt U Biomolecules. 2025; 15(1).

PMID: 39858524 PMC: 11763865. DOI: 10.3390/biom15010130.


Selected Ion Extraction of Peptides with Heavy Isotopes and Hydrogen Loss Reduces the Type II Error in Plasma Proteomics.

Dufresne J, Chen Z, Sehajpal P, Bowden P, Ho J, Hsu C ACS Omega. 2025; 10(1):281-293.

PMID: 39829503 PMC: 11739973. DOI: 10.1021/acsomega.4c05624.


Proteomics Can Rise to the Challenge of Pseudogenes' Coding Nature.

Vasylieva V, Arefiev I, Bourassa F, Trifiro F, Brunet M J Proteome Res. 2024; 23(12):5233-5249.

PMID: 39486438 PMC: 11629383. DOI: 10.1021/acs.jproteome.4c00116.


Dear-PSM: A deep learning-based peptide search engine enables full database search for proteomics.

He Q, Li X, Zhong J, Yang G, Han J, Shuai J Smart Med. 2024; 3(3):e20240014.

PMID: 39420951 PMC: 11425048. DOI: 10.1002/SMMD.20240014.


References
1.
Bern M, Cai Y, Goldberg D . Lookup peaks: a hybrid of de novo sequencing and database search for protein identification by tandem mass spectrometry. Anal Chem. 2007; 79(4):1393-400. DOI: 10.1021/ac0617013. View

2.
Shilov I, Seymour S, Patel A, Loboda A, Tang W, Keating S . The Paragon Algorithm, a next generation search engine that uses sequence temperature values and feature probabilities to identify peptides from tandem mass spectra. Mol Cell Proteomics. 2007; 6(9):1638-55. DOI: 10.1074/mcp.T600050-MCP200. View

3.
Tsur D, Tanner S, Zandi E, Bafna V, Pevzner P . Identification of post-translational modifications by blind search of mass spectra. Nat Biotechnol. 2005; 23(12):1562-7. DOI: 10.1038/nbt1168. View

4.
Beck M, Schmidt A, Malmstroem J, Claassen M, Ori A, Szymborska A . The quantitative proteome of a human cell line. Mol Syst Biol. 2011; 7:549. PMC: 3261713. DOI: 10.1038/msb.2011.82. View

5.
Liu C, Yan B, Song Y, Xu Y, Cai L . Peptide sequence tag-based blind identification of post-translational modifications with point process model. Bioinformatics. 2006; 22(14):e307-13. DOI: 10.1093/bioinformatics/btl226. View