» Articles » PMID: 27665935

CRHunter: Integrating Multifaceted Information to Predict Catalytic Residues in Enzymes

Overview
Journal Sci Rep
Specialty Science
Date 2016 Sep 27
PMID 27665935
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

A variety of algorithms have been developed for catalytic residue prediction based on either feature- or template-based methodology. However, no studies have systematically compared these two strategies and further considered whether their combination could improve the prediction performance. Herein, we developed an integrative algorithm named CRHunter by simultaneously using the complementarity between feature- and template-based methodologies and that between structural and sequence information. Several novel structural features were generated by the Delaunay triangulation and Laplacian transformation of enzyme structures. Combining these features with traditional descriptors, we invented two support vector machine feature predictors based on both structural and sequence information. Furthermore, we established two template predictors using structure and profile alignments. Evaluated on datasets with different levels of homology, our feature predictors achieve relatively stable performance, whereas our template predictors yield poor results when the homological relationships become weak. Nevertheless, the hybrid algorithm CRHunter consistently achieves optimal performance among all our predictors. We also illustrate that our methodology can be applied to the predicted structures of enzymes. Compared with state-of-the-art methods, CRHunter yields comparable or better performance on various datasets. Finally, the application of this algorithm to structural genomics targets sheds light on solved protein structures with unknown functions.

Citing Articles

Precise prediction of phase-separation key residues by machine learning.

Sun J, Qu J, Zhao C, Zhang X, Liu X, Wang J Nat Commun. 2024; 15(1):2662.

PMID: 38531854 PMC: 10965946. DOI: 10.1038/s41467-024-46901-9.


Enzyme function and evolution through the lens of bioinformatics.

Ribeiro A, Riziotis I, Borkakoti N, Thornton J Biochem J. 2023; 480(22):1845-1863.

PMID: 37991346 PMC: 10754289. DOI: 10.1042/BCJ20220405.


Dissecting and predicting different types of binding sites in nucleic acids based on structural information.

Jiang Z, Xiao S, Liu R Brief Bioinform. 2021; 23(1).

PMID: 34624074 PMC: 8769709. DOI: 10.1093/bib/bbab411.


Machine learning differentiates enzymatic and non-enzymatic metals in proteins.

Feehan R, Franklin M, Slusky J Nat Commun. 2021; 12(1):3712.

PMID: 34140507 PMC: 8211803. DOI: 10.1038/s41467-021-24070-3.


CATH functional families predict functional sites in proteins.

Das S, Scholes H, Sen N, Orengo C Bioinformatics. 2020; 37(8):1099-1106.

PMID: 33135053 PMC: 8150129. DOI: 10.1093/bioinformatics/btaa937.


References
1.
Remmert M, Biegert A, Hauser A, Soding J . HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat Methods. 2011; 9(2):173-5. DOI: 10.1038/nmeth.1818. View

2.
Tang Y, Sheng Z, Chen Y, Zhang Z . An improved prediction of catalytic residues in enzyme structures. Protein Eng Des Sel. 2008; 21(5):295-302. DOI: 10.1093/protein/gzn003. View

3.
Sanishvili R, Yakunin A, Laskowski R, Skarina T, Evdokimova E, Doherty-Kirby A . Integrating structure, bioinformatics, and enzymology to discover function: BioH, a new carboxylesterase from Escherichia coli. J Biol Chem. 2003; 278(28):26039-45. PMC: 2792009. DOI: 10.1074/jbc.M303867200. View

4.
Chea E, Livesay D . How accurate and statistically robust are catalytic site predictions based on closeness centrality?. BMC Bioinformatics. 2007; 8:153. PMC: 1876251. DOI: 10.1186/1471-2105-8-153. View

5.
Wu S, Liang M, Altman R . The SeqFEATURE library of 3D functional site models: comparison to existing methods and applications to protein function annotation. Genome Biol. 2008; 9(1):R8. PMC: 2395245. DOI: 10.1186/gb-2008-9-1-r8. View