» Articles » PMID: 25445293

Prediction of Protein Structural Classes for Low-similarity Sequences Using Reduced PSSM and Position-based Secondary Structural Features

Overview
Journal Gene
Specialty Molecular Biology
Date 2014 Dec 3
PMID 25445293
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

Many efficient methods have been proposed to advance protein structural class prediction, but there are still some challenges where additional insight or technology is needed for low-similarity sequences. In this work, we schemed out a new prediction method for low-similarity datasets using reduced PSSM and position-based secondary structural features. We evaluated the proposed method with four experiments and compared it with the available competing prediction methods. The results indicate that the proposed method achieved the best performance among the evaluated methods, with overall accuracy 3-5% higher than the existing best-performing method. This paper also found that the reduced alphabets with size 13 simplify PSSM structures efficiently while reserving its maximal information. This understanding can be used to design more powerful prediction methods for protein structural class.

Citing Articles

Comparative Study on Feature Selection in Protein Structure and Function Prediction.

Yi W, Sun A, Liu M, Liu X, Zhang W, Dai Q Comput Math Methods Med. 2022; 2022:1650693.

PMID: 36267316 PMC: 9578875. DOI: 10.1155/2022/1650693.


Using Recursive Feature Selection with Random Forest to Improve Protein Structural Class Prediction for Low-Similarity Sequences.

Wang Y, Xu Y, Yang Z, Liu X, Dai Q Comput Math Methods Med. 2021; 2021:5529389.

PMID: 34055035 PMC: 8123985. DOI: 10.1155/2021/5529389.


Prediction of protein structural classes by different feature expressions based on 2-D wavelet denoising and fusion.

Wang S, Wang X BMC Bioinformatics. 2019; 20(Suppl 25):701.

PMID: 31874617 PMC: 6929547. DOI: 10.1186/s12859-019-3276-5.


Prediction of protein self-interactions using stacked long short-term memory from protein sequences information.

Wang Y, You Z, Li X, Jiang T, Cheng L, Chen Z BMC Syst Biol. 2018; 12(Suppl 8):129.

PMID: 30577794 PMC: 6302371. DOI: 10.1186/s12918-018-0647-x.


Computational analysis and prediction of lysine malonylation sites by exploiting informative features in an integrative machine-learning framework.

Zhang Y, Xie R, Wang J, Leier A, Marquez-Lago T, Akutsu T Brief Bioinform. 2018; 20(6):2185-2199.

PMID: 30351377 PMC: 6954445. DOI: 10.1093/bib/bby079.