» Articles » PMID: 35462533

SNARER: New Molecular Descriptors for SNARE Proteins Classification

Overview
Publisher Biomed Central
Specialty Biology
Date 2022 Apr 25
PMID 35462533
Authors
Affiliations
Soon will be listed here.
Abstract

Background: SNARE proteins play an important role in different biological functions. This study aims to investigate the contribution of a new class of molecular descriptors (called SNARER) related to the chemical-physical properties of proteins in order to evaluate the performance of binary classifiers for SNARE proteins.

Results: We constructed a SNARE proteins balanced dataset, D128, and an unbalanced one, DUNI, on which we tested and compared the performance of the new descriptors presented here in combination with the feature sets (GAAC, CTDT, CKSAAP and 188D) already present in the literature. The machine learning algorithms used were Random Forest, k-Nearest Neighbors and AdaBoost and oversampling and subsampling techniques were applied to the unbalanced dataset. The addition of the SNARER descriptors increases the precision for all considered ML algorithms. In particular, on the unbalanced DUNI dataset the accuracy increases in parallel with the increase in sensitivity while on the balanced dataset D128 the accuracy increases compared to the counterpart without the addition of SNARER descriptors, with a strong improvement in specificity. Our best result is the combination of our descriptors SNARER with CKSAAP feature on the dataset D128 with 92.3% of accuracy, 90.1% for sensitivity and 95% for specificity with the RF algorithm.

Conclusions: The performed analysis has shown how the introduction of molecular descriptors linked to the chemical-physical and structural characteristics of the proteins can improve the classification performance. Additionally, it was pointed out that performance can change based on using a balanced or unbalanced dataset. The balanced nature of training can significantly improve forecast accuracy.

Citing Articles

A Deep Learning and PSSM Profile Approach for Accurate SNARE Protein Prediction.

Kha Q, Nguyen H, Le N Methods Mol Biol. 2025; 2887():79-89.

PMID: 39806147 DOI: 10.1007/978-1-0716-4314-3_5.


Towards generative digital twins in biomedical research.

Wu J, Koelzer V Comput Struct Biotechnol J. 2024; 23:3481-3488.

PMID: 39435342 PMC: 11491725. DOI: 10.1016/j.csbj.2024.09.030.


Refactoring and performance analysis of the main CNN architectures: using false negative rate minimization to solve the clinical images melanoma detection problem.

Di Biasi L, De Marco F, Auriemma Citarella A, Castrillon-Santana M, Barra P, Tortora G BMC Bioinformatics. 2023; 24(1):386.

PMID: 37821815 PMC: 10568761. DOI: 10.1186/s12859-023-05516-5.


Machine Learning Approaches in Diagnosis, Prognosis and Treatment Selection of Cardiac Amyloidosis.

Allegra A, Mirabile G, Tonacci A, Genovese S, Pioggia G, Gangemi S Int J Mol Sci. 2023; 24(6).

PMID: 36982754 PMC: 10051237. DOI: 10.3390/ijms24065680.


ENTAIL: yEt aNoTher amyloid fIbrils cLassifier.

Auriemma Citarella A, Di Biasi L, De Marco F, Tortora G BMC Bioinformatics. 2022; 23(1):517.

PMID: 36456900 PMC: 9714056. DOI: 10.1186/s12859-022-05070-6.

References
1.
Reuter J, Spacek D, Snyder M . High-throughput sequencing technologies. Mol Cell. 2015; 58(4):586-97. PMC: 4494749. DOI: 10.1016/j.molcel.2015.05.004. View

2.
Liu X, Gong X, Yu H, Xu J . A Model Stacking Framework for Identifying DNA Binding Proteins by Orchestrating Multi-View Features and Classifiers. Genes (Basel). 2018; 9(8). PMC: 6116045. DOI: 10.3390/genes9080394. View

3.
Yang X, Kaeser-Woo Y, Pang Z, Xu W, Sudhof T . Complexin clamps asynchronous release by blocking a secondary Ca(2+) sensor via its accessory α helix. Neuron. 2010; 68(5):907-20. PMC: 3050570. DOI: 10.1016/j.neuron.2010.11.001. View

4.
Chen K, Kurgan L, Ruan J . Prediction of flexible/rigid regions from protein sequences using k-spaced amino acid pairs. BMC Struct Biol. 2007; 7:25. PMC: 1863424. DOI: 10.1186/1472-6807-7-25. View

5.
Chen F, Chen H, Chen Y, Wei W, Sun Y, Zhang L . Dysfunction of the SNARE complex in neurological and psychiatric disorders. Pharmacol Res. 2021; 165:105469. DOI: 10.1016/j.phrs.2021.105469. View