NLStradamus: a Simple Hidden Markov Model for Nuclear Localization Signal Prediction
Overview
Affiliations
Background: Nuclear localization signals (NLSs) are stretches of residues within a protein that are important for the regulated nuclear import of the protein. Of the many import pathways that exist in yeast, the best characterized is termed the 'classical' NLS pathway. The classical NLS contains specific patterns of basic residues and computational methods have been designed to predict the location of these motifs on proteins. The consensus sequences, or patterns, for the other import pathways are less well-understood.
Results: In this paper, we present an analysis of characterized NLSs in yeast, and find, despite the large number of nuclear import pathways, that NLSs seem to show similar patterns of amino acid residues. We test current prediction methods and observe a low true positive rate. We therefore suggest an approach using hidden Markov models (HMMs) to predict novel NLSs in proteins. We show that our method is able to consistently find 37% of the NLSs with a low false positive rate and that our method retains its true positive rate outside of the yeast data set used for the training parameters.
Conclusion: Our implementation of this model, NLStradamus, is made available at: (http://www.moseslab.csb.utoronto.ca/NLStradamus/).
Xie Z, Li C, Huang R, Wu B, Huang Q, Zhang Z Cell Death Differ. 2025; .
PMID: 39962243 DOI: 10.1038/s41418-025-01449-z.
Sequence specificity of an essential nuclear localization sequence in Mcm3.
Wang Z, Zhang Y, Zhang Q, Bilsborrow K, Leslie M, Suhandynata R PLoS Genet. 2025; 21(1):e1011499.
PMID: 39836669 PMC: 11761085. DOI: 10.1371/journal.pgen.1011499.
Homeodomain Involvement in Nuclear HOX Protein Homo- and Heterodimerization.
Marchese D, Evrard L, Bergiers I, Boas L, Duphenieux J, Hermant M Int J Mol Sci. 2025; 26(1.
PMID: 39796276 PMC: 11721573. DOI: 10.3390/ijms26010423.
Defining ortholog-specific UHRF1 inhibition by STELLA for cancer therapy.
Bai W, Xu J, Gu W, Wang D, Cui Y, Rong W Nat Commun. 2025; 16(1):474.
PMID: 39774694 PMC: 11707192. DOI: 10.1038/s41467-024-55481-7.
Maghraby A, Alzalaty M Sci Rep. 2025; 15(1):1142.
PMID: 39774029 PMC: 11707246. DOI: 10.1038/s41598-024-83221-w.