» Articles » PMID: 39436595

Comprehensive Peptide Mapping Is Crucial for Proteogenomics and Proteomics

Overview
Specialty Molecular Biology
Date 2024 Oct 22
PMID 39436595
Authors
Affiliations
Soon will be listed here.
Abstract

Proteogenomics enables the confirmation and refinement of gene models, the detection of new ones, and the proposition of alternative transcripts using support at the protein level. Such evidence is usually generated using mass spectrometry and subsequent result mapping to various sequence databases. This workflow entails several problems: (1) To speed up the analysis, only a small set of expected proteins is searched; (2) database search tools generally do not provide mapping to the genome; and (3) upon new releases of the sequence databases, expensive rerunning of all results would need to be performed. Therefore, fast and accurate peptide mapping is needed as part of proteogenomic pipelines. Unfortunately, some available tools have technical shortcomings. Thus, a set of test cases was developed to allow tool developers to test their implementations comprehensively. The need for comprehensive testing is exemplified by PGx and PGM, two published tools that could only solve a subset of test cases. Lelantos passed all test cases. A set of comprehensive test cases has been developed to overcome these issues. Many unpublished peptide mapping tools are part of proteogenomic workflows, and such tools would also benefit from comprehensive testing. Finally, peptide mapping may also be crucial for proteomics because sequence databases change over time. In response, peptide remapping should be performed to ensure that peptides identifying a protein are proteotypic in a larger sequence context.

References
1.
Castellana N, Bafna V . Proteogenomics to discover the full coding content of genomes: a computational perspective. J Proteomics. 2010; 73(11):2124-35. PMC: 2949459. DOI: 10.1016/j.jprot.2010.06.007. View

2.
Aebersold R . Constellations in a cellular universe. Nature. 2003; 422(6928):115-6. DOI: 10.1038/422115a. View

3.
Allmer J . Algorithms for the de novo sequencing of peptides from tandem mass spectra. Expert Rev Proteomics. 2011; 8(5):645-57. DOI: 10.1586/epr.11.54. View

4.
Geer L, Markey S, Kowalak J, Wagner L, Xu M, Maynard D . Open mass spectrometry search algorithm. J Proteome Res. 2004; 3(5):958-64. DOI: 10.1021/pr0499491. View

5.
Kim S, Pevzner P . MS-GF+ makes progress towards a universal database search tool for proteomics. Nat Commun. 2014; 5:5277. PMC: 5036525. DOI: 10.1038/ncomms6277. View