» Articles » PMID: 30014462

Development of a Novel Clustering Tool for Linear Peptide Sequences

Overview
Journal Immunology
Date 2018 Jul 18
PMID 30014462
Citations 49
Authors
Affiliations
Soon will be listed here.
Abstract

Epitopes identified in large-scale screens of overlapping peptides often share significant levels of sequence identity, complicating the analysis of epitope-related data. Clustering algorithms are often used to facilitate these analyses, but available methods are generally insufficient in their capacity to define biologically meaningful epitope clusters in the context of the immune response. To fulfil this need we developed an algorithm that generates epitope clusters based on representative or consensus sequences. This tool allows the user to cluster peptide sequences on the basis of a specified level of identity by selecting among three different method options. These include the 'clique method', in which all members of the cluster must share the same minimal level of identity with each other, and the 'connected graph method', in which all members of a cluster must share a defined level of identity with at least one other member of the cluster. In cases where it is not possible to define a clear consensus sequence with the connected graph method, a third option provides a novel 'cluster-breaking algorithm' for consensus sequence driven sub-clustering. Herein we demonstrate the tool's clustering performance and applicability using (i) a selection of dengue virus epitopes for the 'clique method', (ii) sets of allergen-derived peptides from related species for the 'connected graph method' and (iii) large data sets of eluted ligand, major histocompatibility complex binding and T-cell recognition data captured within the Immune Epitope Database (IEDB) with the newly developed 'cluster-breaking algorithm'. This novel clustering tool is accessible at http://tools.iedb.org/cluster2/.

Citing Articles

Computational modelling of a multiepitope vaccine targeting glycoprotein-D for herpes simplex virus 2 (HSV-2): an immunoinformatic analysis.

Khan M, Shakya M, Verma C Mol Divers. 2025; .

PMID: 40057939 DOI: 10.1007/s11030-025-11148-z.


Design of multi-epitope-based therapeutic vaccine candidates from HBc and HBx proteins of hepatitis B virus using reverse vaccinology and immunoinformatics approaches.

Naully P, Tan M, Nugrahapraja H, Artarini A, Aditama R, Giri-Rachman E PLoS One. 2024; 19(12):e0313269.

PMID: 39642099 PMC: 11623480. DOI: 10.1371/journal.pone.0313269.


Peptide clustering enhances large-scale analyses and reveals proteolytic signatures in mass spectrometry data.

Hartman E, Forsberg F, Kjellstrom S, Petrlova J, Luo C, Scott A Nat Commun. 2024; 15(1):7128.

PMID: 39164298 PMC: 11336174. DOI: 10.1038/s41467-024-51589-y.


Predicting immune response targets in orthoflaviviruses through sequence homology and computational analysis.

Are V, Roy R, Dhanda S, Neema S, Sahu N, Adithya N J Mol Model. 2024; 30(8):295.

PMID: 39083139 DOI: 10.1007/s00894-024-06088-8.


Comparative membrane proteomic analysis of Tritrichomonas foetus isolates.

Rivero M, Alonso A, Abdala M, Luque M, Carranza P, Coceres V Sci Rep. 2024; 14(1):17033.

PMID: 39043862 PMC: 11266394. DOI: 10.1038/s41598-024-67827-8.


References
1.
Agrawal P, Bhalla S, Usmani S, Singh S, Chaudhary K, Raghava G . CPPsite 2.0: a repository of experimentally validated cell-penetrating peptides. Nucleic Acids Res. 2015; 44(D1):D1098-103. PMC: 4702894. DOI: 10.1093/nar/gkv1266. View

2.
Sievers F, Wilm A, Dineen D, Gibson T, Karplus K, Li W . Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol. 2011; 7:539. PMC: 3261699. DOI: 10.1038/msb.2011.75. View

3.
Carrasco Pro S, Sidney J, Paul S, Arlehamn C, Weiskopf D, Peters B . Automatic Generation of Validated Specific Epitope Sets. J Immunol Res. 2015; 2015:763461. PMC: 4629045. DOI: 10.1155/2015/763461. View

4.
Fikes J, Sette A . Design of multi-epitope, analogue-based cancer vaccines. Expert Opin Biol Ther. 2003; 3(6):985-93. DOI: 10.1517/14712598.3.6.985. View

5.
Paradela A, Alvarez I, Sesma L, Ramos M, Vazquez J, de Castro J . Limited diversity of peptides related to an alloreactive T cell epitope in the HLA-B27-bound peptide repertoire results from restrictions at multiple steps along the processing-loading pathway. J Immunol. 1999; 164(1):329-37. DOI: 10.4049/jimmunol.164.1.329. View