» Articles » PMID: 36742312

Prediction of Celiac Disease Associated Epitopes and Motifs in a Protein

Overview
Journal Front Immunol
Date 2023 Feb 6
PMID 36742312
Authors
Affiliations
Soon will be listed here.
Abstract

Introduction: Celiac disease (CD) is an autoimmune gastrointestinal disorder causes immune-mediated enteropathy against gluten. Gluten immunogenic peptides have the potential to trigger immune responses which leads to damage the small intestine. HLA-DQ2/DQ8 are major alleles that bind to epitope/antigenic region of gluten and induce celiac disease. There is a need to identify CD associated epitopes in protein-based foods and therapeutics.

Methods: In this study, computational tools have been developed to predict CD associated epitopes and motifs. Dataset used for training, testing and evaluation contain experimentally validated CD associated and non-CD associate peptides. We perform positional analysis to identify the most significant position of an amino acid residue in the peptide and checked the frequency of HLA alleles. We also compute amino acid composition to develop machine learning based models. We also developed ensemble method that combines motif-based approach and machine learning based models.

Results And Discussion: Our analysis support existing hypothesis that proline (P) and glutamine (Q) are highly abundant in CD associated peptides. A model based on density of P&Q in peptides has been developed for predicting CD associated peptides which achieve maximum AUROC 0.98 on independent data. We discovered motifs (e.g., QPF, QPQ, PYP) which occurs specifically in CD associated peptides. We also developed machine learning based models using peptide composition and achieved maximum AUROC 0.99. Finally, we developed ensemble method that combines motif-based approach and machine learning based models. The ensemble model-predict CD associated motifs with 100% accuracy on an independent dataset, not used for training. Finally, the best models and motifs has been integrated in a web server and standalone software package "CDpred". We hope this server anticipate the scientific community for the prediction, designing and scanning of CD associated peptides as well as CD associated motifs in a protein/peptide sequence (https://webs.iiitd.edu.in/raghava/cdpred/).

Citing Articles

The Role of Gluten in the Development of Autoimmune Thyroid Diseases: A Narrative Review.

Esfahani K, Asri N, Mahmoudi Ghehsareh M, Rezaei-Tavirani M, Jahani-Sherafat S, Rostami-Nejad M Int J Endocrinol Metab. 2025; 22(3):e153730.

PMID: 40065831 PMC: 11892518. DOI: 10.5812/ijem-153730.


Advancements in Computer-Aided Diagnosis of Celiac Disease: A Systematic Review.

Hartmann Tolic I, Habijan M, Galic I, Nyarko E Biomimetics (Basel). 2024; 9(8).

PMID: 39194472 PMC: 11351869. DOI: 10.3390/biomimetics9080493.


Classification of bioactive peptides: A systematic benchmark of models and encodings.

Bizzotto E, Zampieri G, Treu L, Filannino P, Cagno R, Campanaro S Comput Struct Biotechnol J. 2024; 23:2442-2452.

PMID: 38867723 PMC: 11168199. DOI: 10.1016/j.csbj.2024.05.040.


Application of artificial intelligence approaches to predict the metabolism of xenobiotic molecules by human gut microbiome.

Malwe A, Sharma V Front Microbiol. 2023; 14:1254073.

PMID: 38116528 PMC: 10728657. DOI: 10.3389/fmicb.2023.1254073.


Advances in Understanding the Human Gut Microbiota and Its Implication in Pediatric Celiac Disease-A Narrative Review.

Lupu V, Trandafir L, Adam Raileanu A, Mihai C, Morariu I, Starcea I Nutrients. 2023; 15(11).

PMID: 37299462 PMC: 10255898. DOI: 10.3390/nu15112499.

References
1.
Caio G, Volta U, Sapone A, Leffler D, De Giorgio R, Catassi C . Celiac disease: a comprehensive current review. BMC Med. 2019; 17(1):142. PMC: 6647104. DOI: 10.1186/s12916-019-1380-z. View

2.
Ciccocioppo R, di Sabatino A, Corazza G . The immune recognition of gluten in coeliac disease. Clin Exp Immunol. 2005; 140(3):408-16. PMC: 1809391. DOI: 10.1111/j.1365-2249.2005.02783.x. View

3.
Kumar J, Kumar M, Pandey R, Chauhan N . Physiopathology and Management of Gluten-Induced Celiac Disease. J Food Sci. 2017; 82(2):270-277. DOI: 10.1111/1750-3841.13612. View

4.
Shewry P, Halford N . Cereal seed storage proteins: structures, properties and role in grain utilization. J Exp Bot. 2002; 53(370):947-58. DOI: 10.1093/jexbot/53.370.947. View

5.
Patiyal S, Dhall A, Raghava G . A deep learning-based method for the prediction of DNA interacting residues in a protein. Brief Bioinform. 2022; 23(5). DOI: 10.1093/bib/bbac322. View