Multi-kernel Transfer Learning Based on Chou's PseAAC Formulation for Protein Submitochondria Localization
Overview
Authors
Affiliations
Protein sub-organelle localization, e.g. submitochondria, seems more challenging than general protein subcellular localization, because the determination of protein's micro-level localization within organelle by fluorescent imaging technique would face up with more difficulties. Up to present, there are far few computational methods for protein submitochondria localization, and the existing sequence-based predictive models demonstrate moderate or unsatisfactory performance. Recent researches have demonstrated that gene ontology (GO) is a convincingly effective protein feature for protein subcellular localization. However, the GO information may not be available for novel proteins or sparsely annotated protein subfamilies. In allusion to the problem, we transfer the homology's GO information to the target protein and propose a multi-kernel transfer learning model for protein submitochondria localization (MK-TLM), which substantially extends our previously published work (gene ontology based transfer learning model for protein subcellular localization, GO-TLM). To reduce the risk of performance overestimation, we conduct a more comprehensive survey of the model performance in optimistic case, moderate case and pessimistic case according to the abundance of target protein's GO information. The experiments on submitochondria benchmark datasets show that MK-TLM significantly outperforms the baseline models, and demonstrates excellent performance for novel mitochondria proteins and those mitochondria proteins that belong to the subfamily we know little about.
Ensemble Learning-Based Feature Selection for Phage Protein Prediction.
Liu S, Cui C, Chen H, Liu T Front Microbiol. 2022; 13:932661.
PMID: 35910662 PMC: 9335128. DOI: 10.3389/fmicb.2022.932661.
Cong H, Liu H, Chen Y, Cao Y Med Biol Eng Comput. 2020; 58(12):3017-3038.
PMID: 33078303 DOI: 10.1007/s11517-020-02275-w.
Some illuminating remarks on molecular genetics and genomics as well as drug development.
Chou K Mol Genet Genomics. 2020; 295(2):261-274.
PMID: 31894399 DOI: 10.1007/s00438-019-01634-z.
Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks.
Haghighat Hoseini A, Mirzarezaee M Iran J Biotechnol. 2019; 16(3):e1933.
PMID: 31457027 PMC: 6697825. DOI: 10.15171/ijb.1933.
Savojardo C, Bruciaferri N, Tartari G, Martelli P, Casadio R Bioinformatics. 2019; 36(1):56-64.
PMID: 31218353 PMC: 6956790. DOI: 10.1093/bioinformatics/btz512.