» Articles » PMID: 17203985

Prediction of Missed Cleavage Sites in Tryptic Peptides Aids Protein Identification in Proteomics

Overview
Journal J Proteome Res
Specialty Biochemistry
Date 2007 Jan 6
PMID 17203985
Citations 44
Authors
Affiliations
Soon will be listed here.
Abstract

Protein identification via peptide mass fingerprinting (PMF) remains a key component of high-throughput proteomics experiments in post-genomic science. Candidate protein identifications are made using bioinformatic tools from peptide peak lists obtained via mass spectrometry (MS). These algorithms rely on several search parameters, including the number of potential uncut peptide bonds matching the primary specificity of the hydrolytic enzyme used in the experiment. Typically, up to one of these "missed cleavages" are considered by the bioinformatics search tools, usually after digestion of the in silico proteome by trypsin. Using two distinct, nonredundant datasets of peptides identified via PMF and tandem MS, a simple predictive method based on information theory is presented which is able to identify experimentally defined missed cleavages with up to 90% accuracy from amino acid sequence alone. Using this simple protocol, we are able to "mask" candidate protein databases so that confident missed cleavage sites need not be considered for in silico digestion. We show that that this leads to an improvement in database searching, with two different search engines, using the PMF dataset as a test set. In addition, the improved approach is also demonstrated on an independent PMF data set of known proteins that also has corresponding high-quality tandem MS data, validating the protein identifications. This approach has wider applicability for proteomics database searching, and the program for predicting missed cleavages and masking Fasta-formatted protein sequence databases has been made available via http:// ispider.smith.man.ac uk/MissedCleave.

Citing Articles

Development of a Proteomic Workflow for the Identification of Heparan Sulphate Proteoglycan-Binding Substrates of ADAM17.

Calligaris M, Spano D, Puccio M, Muller S, Bonelli S, Lo Pinto M Proteomics. 2024; 24(23-24):e202400076.

PMID: 39318062 PMC: 11648071. DOI: 10.1002/pmic.202400076.


Noncanonical inheritance of phenotypic information by protein amyloids.

Eroglu M, Zocher T, McAuley J, Webster R, Xiao M, Yu B Nat Cell Biol. 2024; 26(10):1712-1724.

PMID: 39223373 DOI: 10.1038/s41556-024-01494-9.


Optimal conditions for carrying out trypsin digestions on complex proteomes: From bulk samples to single cells.

Mansuri M, Bathla S, Lam T, Nairn A, Williams K J Proteomics. 2024; 297:105109.

PMID: 38325732 PMC: 10939724. DOI: 10.1016/j.jprot.2024.105109.


DbyDeep: Exploration of MS-Detectable Peptides via Deep Learning.

Son J, Na S, Paek E Anal Chem. 2023; 95(30):11193-11200.

PMID: 37459568 PMC: 10401496. DOI: 10.1021/acs.analchem.3c00460.


Detergent-Assisted Protein Digestion-On the Way to Avoid the Key Bottleneck of Shotgun Bottom-Up Proteomics.

Danko K, Lukasheva E, Zhukov V, Zgoda V, Frolov A Int J Mol Sci. 2022; 23(22).

PMID: 36430380 PMC: 9695859. DOI: 10.3390/ijms232213903.


References
1.
Schechter I, Berger A . On the size of the active site in proteases. I. Papain. Biochem Biophys Res Commun. 1967; 27(2):157-62. DOI: 10.1016/s0006-291x(67)80055-x. View

2.
Gattiker A, Bienvenut W, Bairoch A, Gasteiger E . FindPept, a tool to identify unmatched masses in peptide mass fingerprinting protein identification. Proteomics. 2002; 2(10):1435-44. DOI: 10.1002/1615-9861(200210)2:10<1435::AID-PROT1435>3.0.CO;2-9. View

3.
Samuelsson J, Dalevi D, Levander F, Rognvaldsson T . Modular, scriptable and automated analysis tools for high-throughput peptide mass fingerprinting. Bioinformatics. 2004; 20(18):3628-35. DOI: 10.1093/bioinformatics/bth460. View

4.
Stead D, Preece A, Brown A . Universal metrics for quality assessment of protein identifications by mass spectrometry. Mol Cell Proteomics. 2006; 5(7):1205-11. DOI: 10.1074/mcp.M500426-MCP200. View

5.
Gavin A, Aloy P, Grandi P, Krause R, Boesche M, Marzioch M . Proteome survey reveals modularity of the yeast cell machinery. Nature. 2006; 440(7084):631-6. DOI: 10.1038/nature04532. View