Derivation of an Amino Acid Similarity Matrix for Peptide: MHC Binding and Its Application As a Bayesian Prior
Overview
Affiliations
Background: Experts in peptide:MHC binding studies are often able to estimate the impact of a single residue substitution based on a heuristic understanding of amino acid similarity in an experimental context. Our aim is to quantify this measure of similarity to improve peptide:MHC binding prediction methods. This should help compensate for holes and bias in the sequence space coverage of existing peptide binding datasets.
Results: Here, a novel amino acid similarity matrix (PMBEC) is directly derived from the binding affinity data of combinatorial peptide mixtures. Like BLOSUM62, this matrix captures well-known physicochemical properties of amino acid residues. However, PMBEC differs markedly from existing matrices in cases where residue substitution involves a reversal of electrostatic charge. To demonstrate its usefulness, we have developed a new peptide:MHC class I binding prediction method, using the matrix as a Bayesian prior. We show that the new method can compensate for missing information on specific residues in the training data. We also carried out a large-scale benchmark, and its results indicate that prediction performance of the new method is comparable to that of the best neural network based approaches for peptide:MHC class I binding.
Conclusion: A novel amino acid similarity matrix has been derived for peptide:MHC binding interactions. One prominent feature of the matrix is that it disfavors substitution of residues with opposite charges. Given that the matrix was derived from experimentally determined peptide:MHC binding affinity measurements, this feature is likely shared by all peptide:protein interactions. In addition, we have demonstrated the usefulness of the matrix as a Bayesian prior in an improved scoring-matrix based peptide:MHC class I prediction method. A software implementation of the method is available at: http://www.mhc-pathway.net/smmpmbec.
Rahman M, Masum M, Parvin R, Das S, Talukder A PLoS One. 2025; 20(1):e0313559.
PMID: 39761277 PMC: 11703113. DOI: 10.1371/journal.pone.0313559.
Vargas-Montes M, Valencia-Jaramillo M, Valencia-Hernandez J, Gomez-Marin J, Arenas A, Cardona N Med Microbiol Immunol. 2024; 214(1):5.
PMID: 39738923 PMC: 11688256. DOI: 10.1007/s00430-024-00815-x.
Jiang D, Ma Z, Zhang J, Sun Y, Bai T, Liu R Vaccines (Basel). 2024; 12(11).
PMID: 39591116 PMC: 11598499. DOI: 10.3390/vaccines12111214.
Postovskaya A, Vercauteren K, Meysman P, Laukens K Brief Bioinform. 2024; 26(1).
PMID: 39576224 PMC: 11583439. DOI: 10.1093/bib/bbae602.
IMGT/RobustpMHC: robust training for class-I MHC peptide binding prediction.
Kushwaha A, Duroux P, Giudicelli V, Todorov K, Kossida S Brief Bioinform. 2024; 25(6).
PMID: 39504482 PMC: 11540059. DOI: 10.1093/bib/bbae552.