» Articles » PMID: 28658305

Predicting Human Protein Subcellular Localization by Heterogeneous and Comprehensive Approaches

Overview
Journal PLoS One
Date 2017 Jun 29
PMID 28658305
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Drug development and investigation of protein function both require an understanding of protein subcellular localization. We developed a system, REALoc, that can predict the subcellular localization of singleplex and multiplex proteins in humans. This system, based on comprehensive strategy, consists of two heterogeneous systematic frameworks that integrate one-to-one and many-to-many machine learning methods and use sequence-based features, including amino acid composition, surface accessibility, weighted sign aa index, and sequence similarity profile, as well as gene ontology function-based features. REALoc can be used to predict localization to six subcellular compartments (cell membrane, cytoplasm, endoplasmic reticulum/Golgi, mitochondrion, nucleus, and extracellular). REALoc yielded a 75.3% absolute true success rate during five-fold cross-validation and a 57.1% absolute true success rate in an independent database test, which was >10% higher than six other prediction systems. Lastly, we analyzed the effects of Vote and GANN models on singleplex and multiplex localization prediction efficacy. REALoc is freely available at http://predictor.nchu.edu.tw/REALoc.

Citing Articles

Ensemble of Multiple Classifiers for Multilabel Classification of Plant Protein Subcellular Localization.

Wattanapornprom W, Thammarongtham C, Hongsthong A, Lertampaiporn S Life (Basel). 2021; 11(4).

PMID: 33808227 PMC: 8066735. DOI: 10.3390/life11040293.


Plant-mSubP: a computational framework for the prediction of single- and multi-target protein subcellular localization using integrated machine-learning approaches.

Sahu S, Loaiza C, Kaundal R AoB Plants. 2020; 12(3):plz068.

PMID: 32528639 PMC: 7274489. DOI: 10.1093/aobpla/plz068.


Prediction of bacterial E3 ubiquitin ligase effectors using reduced amino acid peptide fingerprinting.

McDermott J, Cort J, Nakayasu E, Pruneda J, Overall C, Adkins J PeerJ. 2019; 7:e7055.

PMID: 31211016 PMC: 6557245. DOI: 10.7717/peerj.7055.

References
1.
Yu C, Chen Y, Lu C, Hwang J . Prediction of protein subcellular localization. Proteins. 2006; 64(3):643-51. DOI: 10.1002/prot.21018. View

2.
Shen H, Chou K . A top-down approach to enhance the power of predicting human protein subcellular localization: Hum-mPLoc 2.0. Anal Biochem. 2009; 394(2):269-74. DOI: 10.1016/j.ab.2009.07.046. View

3.
Kawashima S, Ogata H, Kanehisa M . AAindex: Amino Acid Index Database. Nucleic Acids Res. 1998; 27(1):368-9. PMC: 148186. DOI: 10.1093/nar/27.1.368. View

4.
Blum T, Briesemeister S, Kohlbacher O . MultiLoc2: integrating phylogeny and Gene Ontology terms improves subcellular protein localization prediction. BMC Bioinformatics. 2009; 10:274. PMC: 2745392. DOI: 10.1186/1471-2105-10-274. View

5.
Guo X, Liu F, Ju Y, Wang Z, Wang C . Human Protein Subcellular Localization with Integrated Source and Multi-label Ensemble Classifier. Sci Rep. 2016; 6:28087. PMC: 4914962. DOI: 10.1038/srep28087. View