» Articles » PMID: 17441689

Probability Model for Assessing Proteins Assembled from Peptide Sequences Inferred from Tandem Mass Spectrometry Data

Overview
Journal Anal Chem
Specialty Chemistry
Date 2007 Apr 20
PMID 17441689
Citations 23
Authors
Affiliations
Soon will be listed here.
Abstract

In shotgun proteomics, tandem mass spectrometry is used to identify peptides derived from proteins. After the peptides are detected, proteins are reassembled via a reference database of protein or gene information. Redundancy and homology between protein records in databases make it challenging to assign peptides to proteins that may or may not be in an experimental sample. Here, a probability model is introduced for determining the likelihood that peptides are correctly assigned to proteins. This model derives consistent probability estimates for assembled proteins. The probability scores make it easier to confidently identify proteins in complex samples and to accurately estimate false-positive rates. The algorithm based on this model is robust in creating protein complements from peptides from bovine protein standards, yeast, Ustilago maydis cell lysates, and Arabidopsis thaliana leaves. It also eliminates the side effects of redundancy and homology from the reference databases by employing a new concept of peptide grouping and by coherently distinguishing distinct peptides from unique records and shared peptides from homologous proteins. The software that runs the algorithm, called PANORAMICS, provides a tool to help analyze the data based on a researcher's knowledge about the sample. The software operates efficiently and quickly compared to other software platforms.

Citing Articles

Blistering1 Modulates Virulence Via Vesicle-mediated Protein Secretion.

Jurick 2nd W, Peng H, Beard H, Garrett W, Lichtner F, Luciano-Rosario D Mol Cell Proteomics. 2019; 19(2):344-361.

PMID: 31871254 PMC: 7000123. DOI: 10.1074/mcp.RA119.001831.


Bayesian Hierarchical Model for Protein Identifications.

Mitra R, Gill R, Sikdar S, Datta S J Appl Stat. 2019; 46(1):30-46.

PMID: 31105371 PMC: 6519717. DOI: 10.1080/02664763.2018.1454893.


Different Cellular Origins and Functions of Extracellular Proteins from Escherichia coli O157:H7 and O104:H4 as Determined by Comparative Proteomic Analysis.

Islam N, Nagy A, Garrett W, Shelton D, Cooper B, Nou X Appl Environ Microbiol. 2016; 82(14):4371-4378.

PMID: 27208096 PMC: 4959213. DOI: 10.1128/AEM.00977-16.


Identification of Microorganisms by High Resolution Tandem Mass Spectrometry with Accurate Statistical Significance.

Alves G, Wang G, Ogurtsov A, Drake S, Gucek M, Suffredini A J Am Soc Mass Spectrom. 2015; 27(2):194-210.

PMID: 26510657 PMC: 4723618. DOI: 10.1007/s13361-015-1271-2.


ProteinInferencer: Confident protein identification and multiple experiment comparison for large scale proteomics projects.

Zhang Y, Xu T, Shan B, Hart J, Aslanian A, Han X J Proteomics. 2015; 129:25-32.

PMID: 26196237 PMC: 4630118. DOI: 10.1016/j.jprot.2015.07.006.