» Articles » PMID: 28132027

DRNApred, Fast Sequence-based Method That Accurately Predicts and Discriminates DNA- and RNA-binding Residues

Overview
Specialty Biochemistry
Date 2017 Jan 30
PMID 28132027
Citations 78
Authors
Affiliations
Soon will be listed here.
Abstract

Protein-DNA and protein-RNA interactions are part of many diverse and essential cellular functions and yet most of them remain to be discovered and characterized. Recent research shows that sequence-based predictors of DNA-binding residues accurately find these residues but also cross-predict many RNA-binding residues as DNA-binding, and vice versa. Most of these methods are also relatively slow, prohibiting applications on the whole-genome scale. We describe a novel sequence-based method, DRNApred, which accurately and in high-throughput predicts and discriminates between DNA- and RNA-binding residues. DRNApred was designed using a new dataset with both DNA- and RNA-binding proteins, regression that penalizes cross-predictions, and a novel two-layered architecture. DRNApred outperforms state-of-the-art predictors of DNA- or RNA-binding residues on a benchmark test dataset by substantially reducing the cross predictions and predicting arguably higher quality false positives that are located nearby the native binding residues. Moreover, it also more accurately predicts the DNA- and RNA-binding proteins. Application on the human proteome confirms that DRNApred reduces the cross predictions among the native nucleic acid binders. Also, novel putative DNA/RNA-binding proteins that it predicts share similar subcellular locations and residue charge profiles with the known native binding proteins. Webserver of DRNApred is freely available at http://biomine.cs.vcu.edu/servers/DRNApred/.

Citing Articles

Twenty years of advances in prediction of nucleic acid-binding residues in protein sequences.

Basu S, Yu J, Kihara D, Kurgan L Brief Bioinform. 2025; 26(1).

PMID: 39833102 PMC: 11745544. DOI: 10.1093/bib/bbaf016.


Centromeric localization of αKNL2 and CENP-C proteins in plants depends on their centromere-targeting domain and DNA-binding regions.

Yalagapati S, Ahmadli U, Sinha A, Kalidass M, Dabravolski S, Zuo S Nucleic Acids Res. 2024; 53(4).

PMID: 39718987 PMC: 11879092. DOI: 10.1093/nar/gkae1242.


Benchmarking recent computational tools for DNA-binding protein identification.

Luo X, Chi A, Lin A, Ong T, Wong L, Rahman C Brief Bioinform. 2024; 26(1).

PMID: 39657630 PMC: 11630855. DOI: 10.1093/bib/bbae634.


RNA-protein interaction prediction without high-throughput data: An overview and benchmark of tools.

Krautwurst S, Lamkiewicz K Comput Struct Biotechnol J. 2024; 23:4036-4046.

PMID: 39610906 PMC: 11603007. DOI: 10.1016/j.csbj.2024.11.015.


Accurate Prediction of Protein-Binding Residues in Protein Sequences Using SCRIBER.

Zhang J, Zhou F, Liang X, Kurgan L Methods Mol Biol. 2024; 2867:247-260.

PMID: 39576586 DOI: 10.1007/978-1-0716-4196-5_15.


References
1.
Ahmad S, Gromiha M, Sarai A . Analysis and prediction of DNA-binding proteins and their binding residues based on composition, sequence and structural information. Bioinformatics. 2004; 20(4):477-86. DOI: 10.1093/bioinformatics/btg432. View

2.
Terribilini M, Sander J, Lee J, Zaback P, Jernigan R, Honavar V . RNABindR: a server for analyzing and predicting RNA-binding sites in proteins. Nucleic Acids Res. 2007; 35(Web Server issue):W578-84. PMC: 1933119. DOI: 10.1093/nar/gkm294. View

3.
Kuznetsov I, Gou Z, Li R, Hwang S . Using evolutionary and structural information to predict DNA-binding sites on DNA-binding proteins. Proteins. 2006; 64(1):19-27. DOI: 10.1002/prot.20977. View

4.
. Activities at the Universal Protein Resource (UniProt). Nucleic Acids Res. 2013; 42(Database issue):D191-8. PMC: 3965022. DOI: 10.1093/nar/gkt1140. View

5.
Rost B, Sander C . Conservation and prediction of solvent accessibility in protein families. Proteins. 1994; 20(3):216-26. DOI: 10.1002/prot.340200303. View