» Articles » PMID: 16551468

Efficient Prediction of Nucleic Acid Binding Function from Low-resolution Protein Structures

Overview
Journal J Mol Biol
Publisher Elsevier
Date 2006 Mar 23
PMID 16551468
Citations 64
Authors
Affiliations
Soon will be listed here.
Abstract

Structural genomics projects as well as ab initio protein structure prediction methods provide structures of proteins with no sequence or fold similarity to proteins with known functions. These are often low-resolution structures that may only include the positions of C alpha atoms. We present a fast and efficient method to predict DNA-binding proteins from just the amino acid sequences and low-resolution, C alpha-only protein models. The method uses the relative proportions of certain amino acids in the protein sequence, the asymmetry of the spatial distribution of certain other amino acids as well as the dipole moment of the molecule. These quantities are used in a linear formula, with coefficients derived from logistic regression performed on a training set, and DNA-binding is predicted based on whether the result is above a certain threshold. We show that the method is insensitive to errors in the atomic coordinates and provides correct predictions even on inaccurate protein models. We demonstrate that the method is capable of predicting proteins with novel binding site motifs and structures solved in an unbound state. The accuracy of our method is close to another, published method that uses all-atom structures, time-consuming calculations and information on conserved residues.

Citing Articles

New components of the community-based DNA-repair mechanism in Sulfolobales.

Recalde A, Wagner A, Sivabalasarma S, Yurmashava A, Fehr N, Thurm R Microlife. 2025; 6:uqaf002.

PMID: 39949789 PMC: 11823120. DOI: 10.1093/femsml/uqaf002.


Expanding the Diversity of : A Novel Genus from .

Washington J, Basta H, De Jesus A, Bendele M, Cresawn S, Ginser E Viruses. 2025; 17(1).

PMID: 39861902 PMC: 11768872. DOI: 10.3390/v17010113.


Benchmarking recent computational tools for DNA-binding protein identification.

Luo X, Chi A, Lin A, Ong T, Wong L, Rahman C Brief Bioinform. 2024; 26(1).

PMID: 39657630 PMC: 11630855. DOI: 10.1093/bib/bbae634.


ProkDBP: Toward more precise identification of prokaryotic DNA binding proteins.

Pradhan U, Meher P, Naha S, Das R, Gupta A, Parsad R Protein Sci. 2024; 33(6):e5015.

PMID: 38747369 PMC: 11094783. DOI: 10.1002/pro.5015.


StackDPP: a stacking ensemble based DNA-binding protein prediction model.

Ahmed S, Bose D, Khandoker R, Rahman M BMC Bioinformatics. 2024; 25(1):111.

PMID: 38486135 PMC: 10941422. DOI: 10.1186/s12859-024-05714-9.