An Account of in Silico Identification Tools of Secreted Effector Proteins in Bacteria and Future Challenges
Overview
Authors
Affiliations
Bacterial pathogens secrete numerous effector proteins via six secretion systems, type I to type VI secretion systems, to adapt to new environments or to promote virulence by bacterium-host interactions. Many computational approaches have been used in the identification of effector proteins before the subsequent experimental verification because they tolerate laborious biological procedures and are genome scale, automated and highly efficient. Prevalent examples include machine learning methods and statistical techniques. In this article, we summarize the computational progress toward predicting secreted effector proteins in bacteria, with an opening of an introduction of features that are used to discriminate effectors from non-effectors. The mechanism, contribution and deficiency of previous developed detection tools are presented, which are further benchmarked based on a curated testing data set. According to the results of benchmarking, potential improvements of the prediction performance are discussed, which include (1) more informative features for discriminating the effectors from non-effectors; (2) the construction of comprehensive training data set of the machine learning algorithms; (3) the advancement of reliable prediction methods and (4) a better interpretation of the mechanisms behind the molecular processes. The future of in silico identification of bacterial secreted effectors includes both opportunities and challenges.
Peng Y, Wu J, Sun Y, Zhang Y, Wang Q, Shao S Nat Commun. 2025; 16(1):1299.
PMID: 39900608 PMC: 11791096. DOI: 10.1038/s41467-025-56526-1.
A deep learning method to predict bacterial ADP-ribosyltransferase toxins.
Zheng D, Zhou S, Chen L, Pang G, Yang J Bioinformatics. 2024; 40(7).
PMID: 38885365 PMC: 11219481. DOI: 10.1093/bioinformatics/btae378.
Use of Bastion for the Identification of Secreted Substrates.
Wang J, Li J, Stubenrauch C Methods Mol Biol. 2023; 2715:519-531.
PMID: 37930548 DOI: 10.1007/978-1-0716-3445-5_31.
Jimenez-Guerrero I, Lopez-Baena F, Medina C Plants (Basel). 2023; 12(11).
PMID: 37299112 PMC: 10255152. DOI: 10.3390/plants12112133.
Computational prediction of secreted proteins in gram-negative bacteria.
Hui X, Chen Z, Zhang J, Lu M, Cai X, Deng Y Comput Struct Biotechnol J. 2021; 19:1806-1828.
PMID: 33897982 PMC: 8047123. DOI: 10.1016/j.csbj.2021.03.019.