» Articles » PMID: 38871251

Rescoring Peptide Spectrum Matches: Boosting Proteomics Performance by Integrating Peptide Property Predictors Into Peptide Identification

Overview
Authors
Affiliations
Soon will be listed here.
Abstract

Rescoring of peptide spectrum matches originating from database search engines enabled by peptide property predictors is exceeding the performance of peptide identification from traditional database search engines. In contrast to the peptide spectrum match scores calculated by traditional database search engines, rescoring peptide spectrum matches generates scores based on comparing observed and predicted peptide properties, such as fragment ion intensities and retention times. These newly generated scores enable a more efficient discrimination between correct and incorrect peptide spectrum matches. This approach was shown to lead to substantial improvements in the number of confidently identified peptides, facilitating the analysis of challenging datasets in various fields such as immunopeptidomics, metaproteomics, proteogenomics, and single-cell proteomics. In this review, we summarize the key elements leading up to the recent introduction of multiple data-driven rescoring pipelines. We provide an overview of relevant post-processing rescoring tools, introduce prominent data-driven rescoring pipelines for various applications, and highlight limitations, opportunities, and future perspectives of this approach and its impact on mass spectrometry-based proteomics.

Citing Articles

Proteome-wide non-cleavable crosslink identification with MS Annika 3.0 reveals the structure of the C. elegans Box C/D complex.

Birklbauer M, Muller F, Sivakumar Geetha S, Matzinger M, Mechtler K, Dorfer V Commun Chem. 2024; 7(1):300.

PMID: 39702463 PMC: 11659399. DOI: 10.1038/s42004-024-01386-x.


The 2024 Report on the Human Proteome from the HUPO Human Proteome Project.

Omenn G, Orchard S, Lane L, Lindskog C, Pineau C, Overall C J Proteome Res. 2024; 23(12):5296-5311.

PMID: 39514846 PMC: 11781352. DOI: 10.1021/acs.jproteome.4c00776.

References
1.
Li K, Jain A, Malovannaya A, Wen B, Zhang B . DeepRescore: Leveraging Deep Learning to Improve Peptide Identification in Immunopeptidomics. Proteomics. 2020; 20(21-22):e1900334. PMC: 7718998. DOI: 10.1002/pmic.201900334. View

2.
Kall L, Canterbury J, Weston J, Noble W, MacCoss M . Semi-supervised learning for peptide identification from shotgun proteomics datasets. Nat Methods. 2007; 4(11):923-5. DOI: 10.1038/nmeth1113. View

3.
Palmblad M, Bocker S, Degroeve S, Kohlbacher O, Kall L, Noble W . Interpretation of the DOME Recommendations for Machine Learning in Proteomics and Metabolomics. J Proteome Res. 2022; 21(4):1204-1207. PMC: 8981311. DOI: 10.1021/acs.jproteome.1c00900. View

4.
Buur L, Declercq A, Strobl M, Bouwmeester R, Degroeve S, Martens L . MSRescore 3.0 Is a Modular, Flexible, and User-Friendly Platform to Boost Peptide Identifications, as Showcased with MS Amanda 3.0. J Proteome Res. 2024; 23(8):3200-3207. DOI: 10.1021/acs.jproteome.3c00785. View

5.
Kong A, Leprevost F, Avtonomov D, Mellacheruvu D, Nesvizhskii A . MSFragger: ultrafast and comprehensive peptide identification in mass spectrometry-based proteomics. Nat Methods. 2017; 14(5):513-520. PMC: 5409104. DOI: 10.1038/nmeth.4256. View