» Articles » PMID: 40062260

Predicting Amyloid Proteins Using Attention-based Long Short-term Memory

Overview
Date 2025 Mar 10
PMID 40062260
Authors
Affiliations
Soon will be listed here.
Abstract

Alzheimer's disease (AD) is one of the genetically inherited neurodegenerative disorders that mostly occur when people get old. It can be recognized by severe memory impairment in the late stage, affecting cognitive function and general daily living. Reliable evidence confirms that the enhanced symptoms of AD are linked to the accumulation of amyloid proteins. The dense population of amyloid proteins forms insoluble fibrillar structures, causing significant pathological impacts in various tissues. Understanding amyloid protein's mechanisms and identifying them at an early stage plays an essential role in treating AD as well as prevalent amyloid-related diseases. Recently, although several machine learning methods proposed for amyloid protein identification have shown promising results, most of them have not yet fully exploited the sequence information of the amyloid proteins. In this study, we develop a computational model for identification of amyloid proteins using bidirectional long short-term memory in combination with an attention mechanism. In the testing phase, our findings showed that the model developed by our proposed method outperformed those developed by state-of-the-art methods with an area under the receiver operating characteristic curve of 0.9126.

References
1.
Charoenkwan P, Kanthawong S, Nantasenamat C, Hasan M, Shoombuatong W . iAMY-SCM: Improved prediction and analysis of amyloid proteins using a scoring card method with propensity scores of dipeptides. Genomics. 2020; 113(1 Pt 2):689-698. DOI: 10.1016/j.ygeno.2020.09.065. View

2.
Hochreiter S, Schmidhuber J . Long short-term memory. Neural Comput. 1997; 9(8):1735-80. DOI: 10.1162/neco.1997.9.8.1735. View

3.
Nguyen-Vo T, Trinh Q, Nguyen L, Nguyen-Hoang P, Rahardja S, Nguyen B . iPromoter-Seqvec: identifying promoters using bidirectional long short-term memory and sequence-embedded features. BMC Genomics. 2022; 23(Suppl 5):681. PMC: 9531353. DOI: 10.1186/s12864-022-08829-6. View

4.
Phan L, Park H, Pitti T, Madhavan T, Jeon Y, Manavalan B . MLACP 2.0: An updated machine learning tool for anticancer peptide prediction. Comput Struct Biotechnol J. 2022; 20:4473-4480. PMC: 9421197. DOI: 10.1016/j.csbj.2022.07.043. View

5.
Chiti F, Dobson C . Protein misfolding, functional amyloid, and human disease. Annu Rev Biochem. 2006; 75:333-66. DOI: 10.1146/annurev.biochem.75.101304.123901. View