» Articles » PMID: 35211101

T1SEstacker: A Tri-Layer Stacking Model Effectively Predicts Bacterial Type 1 Secreted Proteins Based on C-Terminal Non-repeats-in-Toxin-Motif Sequence Features

Overview
Journal Front Microbiol
Specialty Microbiology
Date 2022 Feb 25
PMID 35211101
Authors
Affiliations
Soon will be listed here.
Abstract

Type 1 secretion systems play important roles in pathogenicity of Gram-negative bacteria. However, the substrate secretion mechanism remains largely unknown. In this research, we observed the sequence features of repeats-in-toxin (RTX) proteins, a major class of type 1 secreted effectors (T1SEs). We found striking non-RTX-motif amino acid composition patterns at the C termini, most typically exemplified by the enriched "[FLI][VAI]" at the most C-terminal two positions. Machine-learning models, including deep-learning ones, were trained using these sequence-based non-RTX-motif features and further combined into a tri-layer stacking model, T1SEstacker, which predicted the RTX proteins accurately, with a fivefold cross-validated sensitivity of ∼0.89 at the specificity of ∼0.94. Besides substrates with RTX motifs, T1SEstacker can also well distinguish non-RTX-motif T1SEs, further suggesting their potential existence of common secretion signals. T1SEstacker was applied to predict T1SEs from the genomes of representative strains, and we found that both the number and composition of T1SEs varied among strains. The number of T1SEs is estimated to reach 100 or more in each strain, much larger than what we expected. In summary, we made comprehensive sequence analysis on the type 1 secreted RTX proteins, identified common sequence-based features at the C termini, and developed a stacking model that can predict type 1 secreted proteins accurately.

Citing Articles

Protein Sorting Prediction.

Nielsen H Methods Mol Biol. 2023; 2715:27-63.

PMID: 37930519 DOI: 10.1007/978-1-0716-3445-5_2.


DeepSecE: A Deep-Learning-Based Framework for Multiclass Prediction of Secreted Proteins in Gram-Negative Bacteria.

Zhang Y, Guan J, Li C, Wang Z, Deng Z, Gasser R Research (Wash D C). 2023; 6:0258.

PMID: 37886621 PMC: 10599158. DOI: 10.34133/research.0258.


Comprehensive Genomic Analysis Reveals Extensive Diversity of Type I and Type IV Secretion Systems in Klebsiella pneumoniae.

Yang M, Zhou X, Bao Y, Zhang Y, Liu B, Gan L Curr Microbiol. 2023; 80(8):270.

PMID: 37402963 DOI: 10.1007/s00284-023-03362-5.

References
1.
Wang Y, Wei X, Bao H, Liu S . Prediction of bacterial type IV secreted effectors by C-terminal features. BMC Genomics. 2014; 15:50. PMC: 3915618. DOI: 10.1186/1471-2164-15-50. View

2.
Huang B, Troese M, Howe D, Ye S, Sims J, Heinzen R . Anaplasma phagocytophilum APH_0032 is expressed late during infection and localizes to the pathogen-occupied vacuolar membrane. Microb Pathog. 2010; 49(5):273-84. PMC: 2919654. DOI: 10.1016/j.micpath.2010.06.009. View

3.
Hui X, Chen Z, Lin M, Zhang J, Hu Y, Zeng Y . T3SEpp: an Integrated Prediction Pipeline for Bacterial Type III Secreted Effectors. mSystems. 2020; 5(4). PMC: 7406222. DOI: 10.1128/mSystems.00288-20. View

4.
Ryu J, Lee U, Park J, Yoo D, Ahn J . A vector system for ABC transporter-mediated secretion and purification of recombinant proteins in Pseudomonas species. Appl Environ Microbiol. 2014; 81(5):1744-53. PMC: 4325144. DOI: 10.1128/AEM.03514-14. View

5.
Welch R, Hull R, Falkow S . Molecular cloning and physical characterization of a chromosomal hemolysin from Escherichia coli. Infect Immun. 1983; 42(1):178-86. PMC: 264540. DOI: 10.1128/iai.42.1.178-186.1983. View