» Articles » PMID: 15096640

Predicting Subcellular Localization of Proteins for Gram-negative Bacteria by Support Vector Machines Based on N-peptide Compositions

Overview
Journal Protein Sci
Specialty Biochemistry
Date 2004 Apr 21
PMID 15096640
Citations 381
Authors
Affiliations
Soon will be listed here.
Abstract

Gram-negative bacteria have five major subcellular localization sites: the cytoplasm, the periplasm, the inner membrane, the outer membrane, and the extracellular space. The subcellular location of a protein can provide valuable information about its function. With the rapid increase of sequenced genomic data, the need for an automated and accurate tool to predict subcellular localization becomes increasingly important. We present an approach to predict subcellular localization for Gram-negative bacteria. This method uses the support vector machines trained by multiple feature vectors based on n-peptide compositions. For a standard data set comprising 1443 proteins, the overall prediction accuracy reaches 89%, which, to the best of our knowledge, is the highest prediction rate ever reported. Our prediction is 14% higher than that of the recently developed multimodular PSORT-B. Because of its simplicity, this approach can be easily extended to other organisms and should be a useful tool for the high-throughput and large-scale analysis of proteomic and genomic data.

Citing Articles

Proteome-wide identification of druggable targets and inhibitors for multidrug-resistant using an integrative subtractive proteomics and virtual screening approach.

Vemula D, Bhandari V Heliyon. 2025; 11(4):e42584.

PMID: 40066032 PMC: 11891712. DOI: 10.1016/j.heliyon.2025.e42584.


Identification of the xyloglucan endotransglycosylase/hydrolase genes and the role of in drought resistance in poplar.

Yuan W, Yao F, Liu Y, Xiao H, Sun S, Jiang C For Res (Fayettev). 2025; 4:e039.

PMID: 40027451 PMC: 11870306. DOI: 10.48130/forres-0024-0036.


Designing a broad-spectrum multi-epitope subunit vaccine against leptospirosis using immunoinformatics and structural approaches.

Sethi G, Kim Y, Han S, Hwang J Front Immunol. 2025; 15:1503853.

PMID: 39936152 PMC: 11811080. DOI: 10.3389/fimmu.2024.1503853.


Transcriptome analysis provides insights into the role of TLP16 in Musa acuminata Resistance to Fusarium oxysporum f. sp. cubense wilt.

Huo Y, Liu S, Huang H, Li Z, Ahmad M, Zhuo M BMC Plant Biol. 2025; 25(1):90.

PMID: 39844040 PMC: 11752679. DOI: 10.1186/s12870-024-06032-1.


Prospective identification of extracellular triacylglycerol hydrolase with conserved amino acids in 's high G+C genomic dataset.

Sraphet S, Javadi B Biotechnol Rep (Amst). 2025; 45:e00869.

PMID: 39758972 PMC: 11697127. DOI: 10.1016/j.btre.2024.e00869.


References
1.
Bairoch A, Apweiler R . The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 1999; 28(1):45-8. PMC: 102476. DOI: 10.1093/nar/28.1.45. View

2.
Yuan Z . Prediction of protein subcellular locations using Markov chain models. FEBS Lett. 1999; 451(1):23-6. DOI: 10.1016/s0014-5793(99)00506-2. View

3.
Emanuelsson O, Nielsen H, Brunak S, von Heijne G . Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. J Mol Biol. 2000; 300(4):1005-16. DOI: 10.1006/jmbi.2000.3903. View

4.
Chou K . Prediction of protein cellular attributes using pseudo-amino acid composition. Proteins. 2001; 43(3):246-55. DOI: 10.1002/prot.1035. View

5.
Hua S, Sun Z . Support vector machine approach for protein subcellular localization prediction. Bioinformatics. 2001; 17(8):721-8. DOI: 10.1093/bioinformatics/17.8.721. View