Accurate Prediction of Bacterial Type IV Secreted Effectors Using Amino Acid Composition and PSSM Profiles
Overview
Authors
Affiliations
Motivation: Various human pathogens secret effector proteins into hosts cells via the type IV secretion system (T4SS). These proteins play important roles in the interaction between bacteria and hosts. Computational methods for T4SS effector prediction have been developed for screening experimental targets in several isolated bacterial species; however, widely applicable prediction approaches are still unavailable
Results: In this work, four types of distinctive features, namely, amino acid composition, dipeptide composition, .position-specific scoring matrix composition and auto covariance transformation of position-specific scoring matrix, were calculated from primary sequences. A classifier, T4EffPred, was developed using the support vector machine with these features and their different combinations for effector prediction. Various theoretical tests were performed in a newly established dataset, and the results were measured with four indexes. We demonstrated that T4EffPred can discriminate IVA and IVB effectors in benchmark datasets with positive rates of 76.7% and 89.7%, respectively. The overall accuracy of 95.9% shows that the present method is accurate for distinguishing the T4SS effector in unidentified sequences. A classifier ensemble was designed to synthesize all single classifiers. Notable performance improvement was observed using this ensemble system in benchmark tests. To demonstrate the model's application, a genome-scale prediction of effectors was performed in Bartonella henselae, an important zoonotic pathogen. A number of putative candidates were distinguished.
Availability: A web server implementing the prediction method and the source code are both available at http://bioinfo.tmmu.edu.cn/T4EffPred.
Ghulam A, Arif M, Unar A, A Thafar M, Albaradei S, Worachartcheewan A IET Syst Biol. 2025; 19(1):e70002.
PMID: 39905861 PMC: 11794993. DOI: 10.1049/syb2.70002.
A deep learning method to predict bacterial ADP-ribosyltransferase toxins.
Zheng D, Zhou S, Chen L, Pang G, Yang J Bioinformatics. 2024; 40(7).
PMID: 38885365 PMC: 11219481. DOI: 10.1093/bioinformatics/btae378.
Hu Y, Wang Y, Hu X, Chao H, Li S, Ni Q Comput Struct Biotechnol J. 2024; 23:801-812.
PMID: 38328004 PMC: 10847861. DOI: 10.1016/j.csbj.2024.01.015.
Zhang Y, Guan J, Li C, Wang Z, Deng Z, Gasser R Research (Wash D C). 2023; 6:0258.
PMID: 37886621 PMC: 10599158. DOI: 10.34133/research.0258.
AcrNET: predicting anti-CRISPR with deep learning.
Li Y, Wei Y, Xu S, Tan Q, Zong L, Wang J Bioinformatics. 2023; 39(5).
PMID: 37084259 PMC: 10174705. DOI: 10.1093/bioinformatics/btad259.