» Articles » PMID: 31874615

Accurate Classification of Membrane Protein Types Based on Sequence and Evolutionary Information Using Deep Learning

Overview
Publisher Biomed Central
Specialty Biology
Date 2019 Dec 26
PMID 31874615
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Membrane proteins play an important role in the life activities of organisms. Knowing membrane protein types provides clues for understanding the structure and function of proteins. Though various computational methods for predicting membrane protein types have been developed, the results still do not meet the expectations of researchers.

Results: We propose two deep learning models to process sequence information and evolutionary information, respectively. Both models obtained better results than traditional machine learning models. Furthermore, to improve the performance of the sequence information model, we also provide a new vector representation method to replace the one-hot encoding, whose overall success rate improved by 3.81% and 6.55% on two datasets. Finally, a more effective model is obtained by fusing the above two models, whose overall success rate reached 95.68% and 92.98% on two datasets.

Conclusion: The final experimental results show that our method is more effective than existing methods for predicting membrane protein types, which can help laboratory researchers to identify the type of novel membrane proteins.

Citing Articles

Protein engineering in the deep learning era.

Zhou B, Tan Y, Hu Y, Zheng L, Zhong B, Hong L mLife. 2025; 3(4):477-491.

PMID: 39744096 PMC: 11685842. DOI: 10.1002/mlf2.12157.


Hybrid framework for membrane protein type prediction based on the PSSM.

Ruan X, Xia S, Li S, Su Z, Yang J Sci Rep. 2024; 14(1):17156.

PMID: 39060345 PMC: 11282086. DOI: 10.1038/s41598-024-68163-7.


CAT-CBAM-Net: An Automatic Scoring Method for Sow Body Condition Based on CNN and Transformer.

Xue H, Sun Y, Chen J, Tian H, Liu Z, Shen M Sensors (Basel). 2023; 23(18).

PMID: 37765975 PMC: 10535612. DOI: 10.3390/s23187919.


Artificial intelligence-based HDX (AI-HDX) prediction reveals fundamental characteristics to protein dynamics: Mechanisms on SARS-CoV-2 immune escape.

Yu J, Uzuner U, Long B, Wang Z, Yuan J, Dai S iScience. 2023; 26(4):106282.

PMID: 36910327 PMC: 9968663. DOI: 10.1016/j.isci.2023.106282.


Is a Long Non-Coding RNA Necessary for CTCL Cell Growth.

Rassek K, Izykowska K, Zurawek M, Pieniawska M, Nowicka K, Zhao X Int J Mol Sci. 2023; 24(4).

PMID: 36834942 PMC: 9963807. DOI: 10.3390/ijms24043531.


References
1.
Shen Z, Bao W, Huang D . Recurrent Neural Network for Predicting Transcription Factor Binding Sites. Sci Rep. 2018; 8(1):15270. PMC: 6189047. DOI: 10.1038/s41598-018-33321-1. View

2.
Deng S, Huang D . SFAPS: an R package for structure/function analysis of protein sequences based on informational spectrum method. Methods. 2014; 69(3):207-12. DOI: 10.1016/j.ymeth.2014.08.004. View

3.
Wan S, Mak M, Kung S . Mem-ADSVM: A two-layer multi-label predictor for identifying multi-functional types of membrane proteins. J Theor Biol. 2016; 398:32-42. DOI: 10.1016/j.jtbi.2016.03.013. View

4.
Huang D, Zhang L, Han K, Deng S, Yang K, Zhang H . Prediction of protein-protein interactions based on protein-protein correlation using least squares regression. Curr Protein Pept Sci. 2014; 15(6):553-60. DOI: 10.2174/1389203715666140724084019. View

5.
Zou Q, Xing P, Wei L, Liu B . Gene2vec: gene subsequence embedding for prediction of mammalian -methyladenosine sites from mRNA. RNA. 2018; 25(2):205-218. PMC: 6348985. DOI: 10.1261/rna.069112.118. View