MUDE: a New Approach for Optimizing Sensitivity in the Target-decoy Search Strategy for Large-scale Peptide/protein Identification
Overview
Affiliations
The target-decoy search strategy has been successfully applied in shotgun proteomics for validating peptide and protein identifications. If, on one hand, this method has proven to be very efficient for error estimation, on the other hand, little attention has been paid to the resulting sensitivity. Only two scores are normally used and thresholds are explored in a very simplistic way. In this work, a multivariate decoy analysis is described, where many quality parameters are considered. This analysis is treated in our approach as an optimization problem for sensitivity maximization. Furthermore, an efficient heuristic is proposed to solve this problem. Experiments comparing our method, termed MUDE (multivariate decoy database analysis), with traditional bivariate decoy analysis and with Peptide/ProteinProphet showed that our procedure significantly enhances the retrieved number of identifications when comparing the same false discovery rates. Particularly for phosphopeptide/protein identifications, we could demonstrate more than a two-fold increase in sensitivity compared with the Trans-Proteomic Pipeline tools.
Cerqueira F, Vasconcelos A Database (Oxford). 2020; 2020.
PMID: 33206960 PMC: 7673341. DOI: 10.1093/database/baaa067.
Controlling the FDR in imperfect matches to an incomplete database.
Keich U, Noble W J Am Stat Assoc. 2018; 113(523):973-982.
PMID: 30546175 PMC: 6287756. DOI: 10.1080/01621459.2017.1375931.
Keich U, Noble W Res Comput Mol Biol. 2018; 10229:99-116.
PMID: 29326989 PMC: 5758044. DOI: 10.1007/978-3-319-56970-3_7.
Ribeiro Cerqueira F, Ricardo A, de Paiva Oliveira A, Graber A, Baumgartner C BMC Bioinformatics. 2017; 17(Suppl 18):472.
PMID: 28105913 PMC: 5249030. DOI: 10.1186/s12859-016-1341-x.
MUMAL: multivariate analysis in shotgun proteomics using machine learning techniques.
Cerqueira F, Ferreira R, Oliveira A, Gomes A, Ramos H, Graber A BMC Genomics. 2012; 13 Suppl 5:S4.
PMID: 23095859 PMC: 3477001. DOI: 10.1186/1471-2164-13-S5-S4.