» Articles » PMID: 31492094

PTPD: Predicting Therapeutic Peptides by Deep Learning and Word2vec

Overview
Publisher Biomed Central
Specialty Biology
Date 2019 Sep 8
PMID 31492094
Citations 32
Authors
Affiliations
Soon will be listed here.
Abstract

*: Background In the search for therapeutic peptides for disease treatments, many efforts have been made to identify various functional peptides from large numbers of peptide sequence databases. In this paper, we propose an effective computational model that uses deep learning and word2vec to predict therapeutic peptides (PTPD). *: Results Representation vectors of all k-mers were obtained through word2vec based on k-mer co-existence information. The original peptide sequences were then divided into k-mers using the windowing method. The peptide sequences were mapped to the input layer by the embedding vector obtained by word2vec. Three types of filters in the convolutional layers, as well as dropout and max-pooling operations, were applied to construct feature maps. These feature maps were concatenated into a fully connected dense layer, and rectified linear units (ReLU) and dropout operations were included to avoid over-fitting of PTPD. The classification probabilities were generated by a sigmoid function. PTPD was then validated using two datasets: an independent anticancer peptide dataset and a virulent protein dataset, on which it achieved accuracies of 96% and 94%, respectively. *: Conclusions PTPD identified novel therapeutic peptides efficiently, and it is suitable for application as a useful tool in therapeutic peptide design.

Citing Articles

Unraveling diversity by isolating peptide sequences specific to distinct taxonomic groups.

Bochalis E, Patsakis M, Chantzi N, Mouratidis I, Chartoumpekis D, Georgakopoulos-Soares I bioRxiv. 2025; .

PMID: 39975352 PMC: 11839104. DOI: 10.1101/2025.02.05.636664.


pACP-HybDeep: predicting anticancer peptides using binary tree growth based transformer and structural feature encoding with deep-hybrid learning.

Shahid , Hayat M, Alghamdi W, Akbar S, Raza A, Kadir R Sci Rep. 2025; 15(1):565.

PMID: 39747941 PMC: 11695694. DOI: 10.1038/s41598-024-84146-0.


AttentionEP: Predicting essential proteins via fusion of multiscale features by attention mechanisms.

Wu C, Lin B, Zhang J, Gao R, Song R, Liu Z Comput Struct Biotechnol J. 2024; 23:4315-4323.

PMID: 39697678 PMC: 11652892. DOI: 10.1016/j.csbj.2024.11.039.


Contrastive learning for enhancing feature extraction in anticancer peptides.

Lee B, Shin D Brief Bioinform. 2024; 25(3).

PMID: 38725157 PMC: 11082072. DOI: 10.1093/bib/bbae220.


MA-PEP: A novel anticancer peptide prediction framework with multimodal feature fusion based on attention mechanism.

Liang X, Zhao H, Wang J Protein Sci. 2024; 33(4):e4966.

PMID: 38532681 PMC: 10966354. DOI: 10.1002/pro.4966.


References
1.
Kalyanaraman B, Joseph J, Kalivendi S, Wang S, Konorev E, Kotamraju S . Doxorubicin-induced apoptosis: implications in cardiotoxicity. Mol Cell Biochem. 2002; 234-235(1-2):119-24. View

2.
Garg A, Gupta D . VirulentPred: a SVM based prediction method for virulent proteins in bacterial pathogens. BMC Bioinformatics. 2008; 9:62. PMC: 2254373. DOI: 10.1186/1471-2105-9-62. View

3.
Saha S, Raghava G . Prediction of neurotoxins based on their function and source. In Silico Biol. 2008; 7(4-5):369-87. View

4.
Nanni L, Lumini A, Gupta D, Garg A . Identifying bacterial virulent proteins by fusing a set of classifiers based on variants of Chou's pseudo amino acid composition and on evolutionary information. IEEE/ACM Trans Comput Biol Bioinform. 2011; 9(2):467-75. DOI: 10.1109/TCBB.2011.117. View

5.
Compeau P, Pevzner P, Tesler G . How to apply de Bruijn graphs to genome assembly. Nat Biotechnol. 2011; 29(11):987-91. PMC: 5531759. DOI: 10.1038/nbt.2023. View