Single-stranded and Double-stranded DNA-binding Protein Prediction Using HMM Profiles
Overview
Affiliations
Background: DNA-binding proteins perform important roles in cellular processes and are involved in many biological activities. These proteins include crucial protein-DNA binding domains and can interact with single-stranded or double-stranded DNA, and accordingly classified as single-stranded DNA-binding proteins (SSBs) or double-stranded DNA-binding proteins (DSBs). Computational prediction of SSBs and DSBs helps in annotating protein functions and understanding of protein-binding domains.
Results: Performance is reported using the DNA-binding protein dataset that was recently introduced by Wang et al., [1]. The proposed method achieved a sensitivity of 0.600, specificity of 0.792, AUC of 0.758, MCC of 0.369, accuracy of 0.744, and F-measure of 0.536, on the independent test set.
Conclusion: The proposed method with the hidden Markov model (HMM) profiles for feature extraction, outperformed the benchmark method in the literature and achieved an overall improvement of approximately 3%. The source code and supplementary information of the proposed method is available at https://github.com/roneshsharma/Predict-DNA-binding-proteins/wiki.
Accurate prediction of nucleic acid binding proteins using protein language model.
Wu S, Xu J, Guo J Bioinform Adv. 2025; 5(1):vbaf008.
PMID: 39990254 PMC: 11845279. DOI: 10.1093/bioadv/vbaf008.
Improved prediction of DNA and RNA binding proteins with deep learning models.
Wu S, Guo J Brief Bioinform. 2024; 25(4).
PMID: 38856168 PMC: 11163377. DOI: 10.1093/bib/bbae285.
Guo J, Malik F Biomolecules. 2022; 12(9).
PMID: 36139026 PMC: 9496475. DOI: 10.3390/biom12091187.
DeepFeature: feature selection in nonimage data using convolutional neural network.
Sharma A, Lysenko A, Boroevich K, Vans E, Tsunoda T Brief Bioinform. 2021; 22(6).
PMID: 34368836 PMC: 8575039. DOI: 10.1093/bib/bbab297.