A Combined Transmembrane Topology and Signal Peptide Prediction Method
Overview
Molecular Biology
Affiliations
An inherent problem in transmembrane protein topology prediction and signal peptide prediction is the high similarity between the hydrophobic regions of a transmembrane helix and that of a signal peptide, leading to cross-reaction between the two types of predictions. To improve predictions further, it is therefore important to make a predictor that aims to discriminate between the two classes. In addition, topology information can be gained when successfully predicting a signal peptide leading a transmembrane protein since it dictates that the N terminus of the mature protein must be on the non-cytoplasmic side of the membrane. Here, we present Phobius, a combined transmembrane protein topology and signal peptide predictor. The predictor is based on a hidden Markov model (HMM) that models the different sequence regions of a signal peptide and the different regions of a transmembrane protein in a series of interconnected states. Training was done on a newly assembled and curated dataset. Compared to TMHMM and SignalP, errors coming from cross-prediction between transmembrane segments and signal peptides were reduced substantially by Phobius. False classifications of signal peptides were reduced from 26.1% to 3.9% and false classifications of transmembrane helices were reduced from 19.0% to 7.7%. Phobius was applied to the proteomes of Homo sapiens and Escherichia coli. Here we also noted a drastic reduction of false classifications compared to TMHMM/SignalP, suggesting that Phobius is well suited for whole-genome annotation of signal peptides and transmembrane regions. The method is available at as well as at
Hill R, Grey M, Fedi M, Smith D, Canning G, Ward S BMC Genomics. 2025; 26(1):239.
PMID: 40075289 PMC: 11905480. DOI: 10.1186/s12864-025-11432-0.
Schwarz E, Noon J, Chicca J, Garceau C, Li H, Antoshechkin I bioRxiv. 2025; .
PMID: 39975173 PMC: 11838427. DOI: 10.1101/2025.02.01.636063.
Dispensable genome and segmental duplications drive the genome plasticity in .
Navasca A, Singh J, Rivera-Varas V, Gill U, Secor G, Baldwin T Front Fungal Biol. 2025; 6:1432339.
PMID: 39974207 PMC: 11835900. DOI: 10.3389/ffunb.2025.1432339.
Meitil I, de O G Silva C, Pedersen A, Agger J iScience. 2025; 28(2):111771.
PMID: 39925425 PMC: 11803259. DOI: 10.1016/j.isci.2025.111771.
Vasistha P, Singh P, Srivastava D, Johny L, Shukla S BMC Genomics. 2025; 26(1):42.
PMID: 39819563 PMC: 11736945. DOI: 10.1186/s12864-024-10918-7.