» Articles » PMID: 35386688

Unsupervised Mining of HLA-I Peptidomes Reveals New Binding Motifs and Potential False Positives in the Community Database

Overview
Journal Front Immunol
Date 2022 Apr 7
PMID 35386688
Authors
Affiliations
Soon will be listed here.
Abstract

Modern vaccine designs and studies of human leukocyte antigen (HLA)-mediated immune responses rely heavily on the knowledge of HLA allele-specific binding motifs and computational prediction of HLA-peptide binding affinity. Breakthroughs in HLA peptidomics have considerably expanded the databases of natural HLA ligands and enabled detailed characterizations of HLA-peptide binding specificity. However, cautions must be made when analyzing HLA peptidomics data because identified peptides may be contaminants in mass spectrometry or may weakly bind to the HLA molecules. Here, a hybrid peptide sequencing approach was applied to large-scale mono-allelic HLA peptidomics datasets to uncover new ligands and refine current knowledge of HLA binding motifs. Up to 12-40% of the peptidomics data were low-binding affinity peptides with an arginine or a lysine at the C-terminus and likely to be tryptic peptide contaminants. Thousands of these peptides have been reported in a community database as legitimate ligands and might be erroneously used for training prediction models. Furthermore, unsupervised clustering of identified ligands revealed additional binding motifs for several HLA class I alleles and effectively isolated outliers that were experimentally confirmed to be false positives. Overall, our findings expanded the knowledge of HLA binding specificity and advocated for more rigorous interpretation of HLA peptidomics data that will ensure the high validity of community HLA ligandome databases.

Citing Articles

MHCSeqNet2-improved peptide-class I MHC binding prediction for alleles with low data.

Wongklaew P, Sriswasdi S, Chuangsuwanich E Bioinformatics. 2023; 40(1).

PMID: 38152987 PMC: 10783953. DOI: 10.1093/bioinformatics/btad780.


A microfluidics-enabled automated workflow of sample preparation for MS-based immunopeptidomics.

Li X, Pak H, Huber F, Michaux J, Taillandier-Coindard M, Altimiras E Cell Rep Methods. 2023; 3(6):100479.

PMID: 37426762 PMC: 10326370. DOI: 10.1016/j.crmeth.2023.100479.


Improved predictions of antigen presentation and TCR recognition with MixMHCpred2.2 and PRIME2.0 reveal potent SARS-CoV-2 CD8 T-cell epitopes.

Gfeller D, Schmidt J, Croce G, Guillaume P, Bobisse S, Genolet R Cell Syst. 2023; 14(1):72-83.e5.

PMID: 36603583 DOI: 10.1016/j.cels.2022.12.002.

References
1.
Zhang J, Xin L, Shan B, Chen W, Xie M, Yuen D . PEAKS DB: de novo sequencing assisted database search for sensitive and accurate peptide identification. Mol Cell Proteomics. 2011; 11(4):M111.010587. PMC: 3322562. DOI: 10.1074/mcp.M111.010587. View

2.
Chambers M, MacLean B, Burke R, Amodei D, Ruderman D, Neumann S . A cross-platform toolkit for mass spectrometry and proteomics. Nat Biotechnol. 2012; 30(10):918-20. PMC: 3471674. DOI: 10.1038/nbt.2377. View

3.
Keller B, Sui J, Young A, Whittal R . Interferences and contaminants encountered in modern mass spectrometry. Anal Chim Acta. 2008; 627(1):71-81. DOI: 10.1016/j.aca.2008.04.043. View

4.
Trolle T, McMurtrey C, Sidney J, Bardet W, Osborn S, Kaever T . The Length Distribution of Class I-Restricted T Cell Epitopes Is Determined by Both Peptide Supply and MHC Allele-Specific Binding Preference. J Immunol. 2016; 196(4):1480-7. PMC: 4744552. DOI: 10.4049/jimmunol.1501721. View

5.
Abelin J, Keskin D, Sarkizova S, Hartigan C, Zhang W, Sidney J . Mass Spectrometry Profiling of HLA-Associated Peptidomes in Mono-allelic Cells Enables More Accurate Epitope Prediction. Immunity. 2017; 46(2):315-326. PMC: 5405381. DOI: 10.1016/j.immuni.2017.02.007. View