Prediction of Protein Structural Classes for Low-similarity Sequences Using Reduced PSSM and Position-based Secondary Structural Features
Overview
Affiliations
Many efficient methods have been proposed to advance protein structural class prediction, but there are still some challenges where additional insight or technology is needed for low-similarity sequences. In this work, we schemed out a new prediction method for low-similarity datasets using reduced PSSM and position-based secondary structural features. We evaluated the proposed method with four experiments and compared it with the available competing prediction methods. The results indicate that the proposed method achieved the best performance among the evaluated methods, with overall accuracy 3-5% higher than the existing best-performing method. This paper also found that the reduced alphabets with size 13 simplify PSSM structures efficiently while reserving its maximal information. This understanding can be used to design more powerful prediction methods for protein structural class.
Comparative Study on Feature Selection in Protein Structure and Function Prediction.
Yi W, Sun A, Liu M, Liu X, Zhang W, Dai Q Comput Math Methods Med. 2022; 2022:1650693.
PMID: 36267316 PMC: 9578875. DOI: 10.1155/2022/1650693.
Wang Y, Xu Y, Yang Z, Liu X, Dai Q Comput Math Methods Med. 2021; 2021:5529389.
PMID: 34055035 PMC: 8123985. DOI: 10.1155/2021/5529389.
Wang S, Wang X BMC Bioinformatics. 2019; 20(Suppl 25):701.
PMID: 31874617 PMC: 6929547. DOI: 10.1186/s12859-019-3276-5.
Wang Y, You Z, Li X, Jiang T, Cheng L, Chen Z BMC Syst Biol. 2018; 12(Suppl 8):129.
PMID: 30577794 PMC: 6302371. DOI: 10.1186/s12918-018-0647-x.
Zhang Y, Xie R, Wang J, Leier A, Marquez-Lago T, Akutsu T Brief Bioinform. 2018; 20(6):2185-2199.
PMID: 30351377 PMC: 6954445. DOI: 10.1093/bib/bby079.