ANuPP: A Versatile Tool to Predict Aggregation Nucleating Regions in Peptides and Proteins

Overview

Journal J Mol Biol

Publisher Elsevier

Specialties Microbiology
Molecular Biology

Date 2021 May 11

PMID 33972019

Citations 21

Authors

R Prabakaran

Puneet Rawat

Sandeep Kumar

M Michael Gromiha

Affiliations

Soon will be listed here.

Abstract

Short aggregation prone sequence motifs can trigger aggregation in peptide and protein sequences. Most algorithms developed so far to identify potential aggregation prone regions (APRs) use amino acid residue composition and/or sequence pattern features. In this work, we have investigated the importance of atomic-level characteristics rather than residue level to understand the initiation of aggregation in proteins and peptides. Using atomic-level features an ensemble-classifier, ANuPP has been developed to predict the aggregation-nucleating regions in peptides and proteins. In a dataset of 1279 hexapeptides, ANuPP achieved an area under the curve (AUC) of 0.831 with 77% accuracy on 10-fold cross-validation and an AUC of 0.883 with 83% accuracy in a blind test dataset of 142 hexapeptides. Further, it showed an average SOV of 48.7% on identifying APR regions in 37 proteins. The performance of ANuPP is better than other methods reported in the literature on both amyloidogenic hexapeptide prediction and APR identification. We have developed a web server for ANuPP and it is available at https://web.iitm.ac.in/bioinfo2/ANuPP/. Insights gained from this work demonstrate the importance of atomic and functional group characteristics towards diversity of atomic level origins as well as mechanisms of protein aggregation.

Citing Articles

Proteolysis-Based Biomarker Repertoire of the Neurofilament Proteome.

Petzold A J Neurochem. 2025; 169(3):e70023.

PMID: 40066701 PMC: 11894590. DOI: 10.1111/jnc.70023.

Predicting amyloid proteins using attention-based long short-term memory.

Li Z PeerJ Comput Sci. 2025; 11:e2660.

PMID: 40062260 PMC: 11888867. DOI: 10.7717/peerj-cs.2660.

AggNet: Advancing protein aggregation analysis through deep learning and protein language model.

He W, Xu X, Li H, Zhou J, Gao X Protein Sci. 2025; 34(2):e70031.

PMID: 39840791 PMC: 11751882. DOI: 10.1002/pro.70031.

Prediction and Evaluation of Protein Aggregation with Computational Methods.

Hassan M, Shahzadi S, Li M, Kloczkowski A Methods Mol Biol. 2024; 2867:299-314.

PMID: 39576588 DOI: 10.1007/978-1-0716-4196-5_17.

iAmyP: A Multi-view Learning for Amyloidogenic Hexapeptides Identification Based on Sequence Least Squares Programming.

Cai J, Zhao J, Bin Y, Xia J, Zheng C Interdiscip Sci. 2024; .

PMID: 39546159 DOI: 10.1007/s12539-024-00666-3.