» Articles » PMID: 28960077

Enhanced Missing Proteins Detection in NCI60 Cell Lines Using an Integrative Search Engine Approach

Abstract

The Human Proteome Project (HPP) aims deciphering the complete map of the human proteome. In the past few years, significant efforts of the HPP teams have been dedicated to the experimental detection of the missing proteins, which lack reliable mass spectrometry evidence of their existence. In this endeavor, an in depth analysis of shotgun experiments might represent a valuable resource to select a biological matrix in design validation experiments. In this work, we used all the proteomic experiments from the NCI60 cell lines and applied an integrative approach based on the results obtained from Comet, Mascot, OMSSA, and X!Tandem. This workflow benefits from the complementarity of these search engines to increase the proteome coverage. Five missing proteins C-HPP guidelines compliant were identified, although further validation is needed. Moreover, 165 missing proteins were detected with only one unique peptide, and their functional analysis supported their participation in cellular pathways as was also proposed in other studies. Finally, we performed a combined analysis of the gene expression levels and the proteomic identifications from the common cell lines between the NCI60 and the CCLE project to suggest alternatives for further validation of missing protein observations.

Citing Articles

DeepPD: A Deep Learning Method for Predicting Peptide Detectability Based on Multi-feature Representation and Information Bottleneck.

Li F, Bin Y, Zhao J, Zheng C Interdiscip Sci. 2024; 17(1):200-214.

PMID: 39661307 DOI: 10.1007/s12539-024-00665-4.


PowerNovo: de novo peptide sequencing via tandem mass spectrometry using an ensemble of transformer and BERT models.

Petrovskiy D, Nikolsky K, Kulikova L, Rudnev V, Butkova T, Malsagova K Sci Rep. 2024; 14(1):15000.

PMID: 38951578 PMC: 11217302. DOI: 10.1038/s41598-024-65861-0.


DbyDeep: Exploration of MS-Detectable Peptides via Deep Learning.

Son J, Na S, Paek E Anal Chem. 2023; 95(30):11193-11200.

PMID: 37459568 PMC: 10401496. DOI: 10.1021/acs.analchem.3c00460.


PD-BertEDL: An Ensemble Deep Learning Method Using BERT and Multivariate Representation to Predict Peptide Detectability.

Wang H, Wang J, Feng Z, Li Y, Zhao H Int J Mol Sci. 2022; 23(20).

PMID: 36293242 PMC: 9604182. DOI: 10.3390/ijms232012385.


Prediction of Peptide Detectability Based on CapsNet and Convolutional Block Attention Module.

Yu M, Duan Y, Li Z, Zhang Y Int J Mol Sci. 2021; 22(21).

PMID: 34769509 PMC: 8584443. DOI: 10.3390/ijms222112080.

References
1.
Schaeffer M, Gateau A, Teixeira D, Michel P, Zahn-Zabal M, Lane L . The neXtProt peptide uniqueness checker: a tool for the proteomics community. Bioinformatics. 2017; 33(21):3471-3472. PMC: 5860159. DOI: 10.1093/bioinformatics/btx318. View

2.
Geer L, Markey S, Kowalak J, Wagner L, Xu M, Maynard D . Open mass spectrometry search algorithm. J Proteome Res. 2004; 3(5):958-64. DOI: 10.1021/pr0499491. View

3.
Audain E, Uszkoreit J, Sachsenberg T, Pfeuffer J, Liang X, Hermjakob H . In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics. J Proteomics. 2016; 150:170-182. DOI: 10.1016/j.jprot.2016.08.002. View

4.
Janssen J, Vaandrager J, Heuser T, Jauch A, Kluin P, Geelen E . Concurrent activation of a novel putative transforming gene, myeov, and cyclin D1 in a subset of multiple myeloma cell lines with t(11;14)(q13;q32). Blood. 2001; 95(8):2691-8. View

5.
Paik Y, Hancock W . Uniting ENCODE with genome-wide proteomics. Nat Biotechnol. 2012; 30(11):1065-7. DOI: 10.1038/nbt.2416. View