» Articles » PMID: 31319963

Identification of Clathrin Proteins by Incorporating Hyperparameter Optimization in Deep Learning and PSSM Profiles

Overview
Date 2019 Jul 20
PMID 31319963
Citations 24
Authors
Affiliations
Soon will be listed here.
Abstract

Background And Objectives: Clathrin is an adaptor protein that serves as the principal element of the vesicle-coating complex and is important for the membrane cleavage to dispense the invaginated vesicle from the plasma membrane. The functional loss of clathrins has been tied to a lot of human diseases, i.e., neurodegenerative disorders, cancer, Alzheimer's diseases, and so on. Therefore, creating a precise model to identify its functions is a crucial step towards understanding human diseases and designing drug targets.

Methods: We present a deep learning model using a two-dimensional convolutional neural network (CNN) and position-specific scoring matrix (PSSM) profiles to identify clathrin proteins from high throughput sequences. Traditionally, the 2D CNNs take images as an input so we treated the PSSM profile with a 20 × 20 matrix as an image of 20 × 20 pixels. The input PSSM profile was then connected to our 2D CNN in which we set a variety of parameters to improve the performance of the model. Based on the 10-fold cross-validation results, hyper-parameter optimization process was employed to find the best model for our dataset. Finally, an independent dataset was used to assess the predictive ability of the current model.

Results: Our model could identify clathrin proteins with sensitivity of 92.2%, specificity of 91.2%, accuracy of 91.8%, and MCC of 0.83 in the independent dataset. Compared to state-of-the-art traditional neural networks, our method achieved a significant improvement in all typical measurement metrics.

Conclusions: Throughout the proposed study, we provide an effective tool for investigating clathrin proteins and our achievement could promote the use of deep learning in biomedical research. We also provide source codes and dataset freely at https://www.github.com/khanhlee/deep-clathrin/.

Citing Articles

TargetCLP: clathrin proteins prediction combining transformed and evolutionary scale modeling-based multi-view features via weighted feature integration approach.

Ullah M, Akbar S, Raza A, Khan K, Zou Q Brief Bioinform. 2025; 26(1.

PMID: 39844339 PMC: 11753890. DOI: 10.1093/bib/bbaf026.


A BERT-based approach for identifying anti-inflammatory peptides using sequence information.

Xu T, Wang Q, Yang Z, Ying J Heliyon. 2024; 10(12):e32951.

PMID: 38988537 PMC: 11234020. DOI: 10.1016/j.heliyon.2024.e32951.


Pre-trained protein language model sheds new light on the prediction of Arabidopsis protein-protein interactions.

Zhou K, Lei C, Zheng J, Huang Y, Zhang Z Plant Methods. 2023; 19(1):141.

PMID: 38062445 PMC: 10704805. DOI: 10.1186/s13007-023-01119-6.


Optimizing Hyperparameter Tuning in Machine Learning to Improve the Predictive Performance of Cross-Species N6-Methyladenosine Sites.

Le N, Xu L ACS Omega. 2023; 8(42):39420-39426.

PMID: 37901522 PMC: 10600906. DOI: 10.1021/acsomega.3c05074.


AbAgIntPre: A deep learning method for predicting antibody-antigen interactions based on sequence information.

Huang Y, Zhang Z, Zhou Y Front Immunol. 2023; 13:1053617.

PMID: 36618397 PMC: 9813736. DOI: 10.3389/fimmu.2022.1053617.