Combinatorial Approach for Large-scale Identification of Linked Peptides from Tandem Mass Spectrometry Spectra
Overview
Cell Biology
Molecular Biology
Authors
Affiliations
The combination of chemical cross-linking and mass spectrometry has recently been shown to constitute a powerful tool for studying protein-protein interactions and elucidating the structure of large protein complexes. However, computational methods for interpreting the complex MS/MS spectra from linked peptides are still in their infancy, making the high-throughput application of this approach largely impractical. Because of the lack of large annotated datasets, most current approaches do not capture the specific fragmentation patterns of linked peptides and therefore are not optimal for the identification of cross-linked peptides. Here we propose a generic approach to address this problem and demonstrate it using disulfide-bridged peptide libraries to (i) efficiently generate large mass spectral reference data for linked peptides at a low cost and (ii) automatically train an algorithm that can efficiently and accurately identify linked peptides from MS/MS spectra. We show that using this approach we were able to identify thousands of MS/MS spectra from disulfide-bridged peptides through comparison with proteome-scale sequence databases and significantly improve the sensitivity of cross-linked peptide identification. This allowed us to identify 60% more direct pairwise interactions between the protein subunits in the 20S proteasome complex than existing tools on cross-linking studies of the proteasome complexes. The basic framework of this approach and the MS/MS reference dataset generated should be valuable resources for the future development of new tools for the identification of linked peptides.
Histone Interaction Landscapes Visualized by Crosslinking Mass Spectrometry in Intact Cell Nuclei.
Fasci D, Ingen H, Scheltema R, Heck A Mol Cell Proteomics. 2018; 17(10):2018-2033.
PMID: 30021884 PMC: 6166682. DOI: 10.1074/mcp.RA118.000924.
Lu S, Cao Y, Fan S, Chen Z, Fang R, He S Biophys Rep. 2018; 4(2):68-81.
PMID: 29756007 PMC: 5937861. DOI: 10.1007/s41048-018-0050-6.
Cardiovascular proteomics in the era of big data: experimental and computational advances.
Lam M, Lau E, Ng D, Wang D, Ping P Clin Proteomics. 2016; 13:23.
PMID: 27980500 PMC: 5137214. DOI: 10.1186/s12014-016-9124-y.
Protein Structural Analysis via Mass Spectrometry-Based Proteomics.
Artigues A, Nadeau O, Rimmer M, Villar M, Du X, Fenton A Adv Exp Med Biol. 2016; 919:397-431.
PMID: 27975228 PMC: 5271599. DOI: 10.1007/978-3-319-41448-5_19.
Sarpe V, Rafiei A, Hepburn M, Ostan N, Schryvers A, Schriemer D Mol Cell Proteomics. 2016; 15(9):3071-80.
PMID: 27412762 PMC: 5013318. DOI: 10.1074/mcp.O116.058685.