» Articles » PMID: 19731372

The Other 90% of the Protein: Assessment Beyond the Calphas for CASP8 Template-based and High-accuracy Models

Overview
Journal Proteins
Date 2009 Sep 5
PMID 19731372
Citations 51
Authors
Affiliations
Soon will be listed here.
Abstract

For template-based modeling in the CASP8 Critical Assessment of Techniques for Protein Structure Prediction, this work develops and applies six new full-model metrics. They are designed to complement and add value to the traditional template-based assessment by the global distance test (GDT) and related scores (based on multiple superpositions of Calpha atoms between target structure and predictions labeled "Model 1"). The new metrics evaluate each predictor group on each target, using all atoms of their best model with above-average GDT. Two metrics evaluate how "protein-like" the predicted model is: the MolProbity score used for validating experimental structures, and a mainchain reality score using all-atom steric clashes, bond length and angle outliers, and backbone dihedrals. Four other new metrics evaluate match of model to target for mainchain and sidechain hydrogen bonds, sidechain end positioning, and sidechain rotamers. Group-average Z-score across the six full-model measures is averaged with group-average GDT Z-score to produce the overall ranking for full-model, high-accuracy performance. Separate assessments are reported for specific aspects of predictor-group performance, such as robustness of approximately correct template or fold identification, and self-scoring ability at identifying the best of their models. Fold identification is distinct from but correlated with group-average GDT Z-score if target difficulty is taken into account, whereas self-scoring is done best by servers and is uncorrelated with GDT performance. Outstanding individual models on specific targets are identified and discussed. Predictor groups excelled at different aspects, highlighting the diversity of current methodologies. However, good full-model scores correlate robustly with high Calpha accuracy.

Citing Articles

Clustering Protein Binding Pockets and Identifying Potential Drug Interactions: A Novel Ligand-Based Featurization Method.

Stevenson G, Kirshner D, Bennion B, Yang Y, Zhang X, Zemla A J Chem Inf Model. 2023; 63(21):6655-6666.

PMID: 37847557 PMC: 10647021. DOI: 10.1021/acs.jcim.3c00722.


The transformative power of transformers in protein structure prediction.

Moussad B, Roche R, Bhattacharya D Proc Natl Acad Sci U S A. 2023; 120(32):e2303499120.

PMID: 37523536 PMC: 10410766. DOI: 10.1073/pnas.2303499120.


A Computational Pipeline to Identify and Characterize Binding Sites and Interacting Chemotypes in SARS-CoV-2.

Sandholtz S, Drocco J, Zemla A, Torres M, Silva M, Allen J ACS Omega. 2023; 8(24):21871-21884.

PMID: 37309388 PMC: 10254058. DOI: 10.1021/acsomega.3c01621.


PDBspheres: a method for finding 3D similarities in local regions in proteins.

Zemla A, Allen J, Kirshner D, Lightstone F NAR Genom Bioinform. 2022; 4(4):lqac078.

PMID: 36225529 PMC: 9549786. DOI: 10.1093/nargab/lqac078.


Topology evaluation of models for difficult targets in the 14th round of the critical assessment of protein structure prediction (CASP14).

Kinch L, Pei J, Kryshtafovych A, Schaeffer R, Grishin N Proteins. 2021; 89(12):1673-1686.

PMID: 34240477 PMC: 8616777. DOI: 10.1002/prot.26172.


References
1.
Word J, Lovell S, Richardson J, Richardson D . Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation. J Mol Biol. 1999; 285(4):1735-47. DOI: 10.1006/jmbi.1998.2401. View

2.
Krieger E, Joo K, Lee J, Lee J, Raman S, Thompson J . Improving physical realism, stereochemistry, and side-chain accuracy in homology modeling: Four approaches that performed well in CASP8. Proteins. 2009; 77 Suppl 9:114-22. PMC: 2922016. DOI: 10.1002/prot.22570. View

3.
Arendall 3rd W, Tempel W, Richardson J, Zhou W, Wang S, Davis I . A test of enhancing model accuracy in high-throughput crystallography. J Struct Funct Genomics. 2005; 6(1):1-11. DOI: 10.1007/s10969-005-3138-4. View

4.
Kopp J, Bordoli L, Battey J, Kiefer F, Schwede T . Assessment of CASP7 predictions for template-based modeling targets. Proteins. 2007; 69 Suppl 8:38-56. DOI: 10.1002/prot.21753. View

5.
Read R, Chavali G . Assessment of CASP7 predictions in the high accuracy template-based modeling category. Proteins. 2007; 69 Suppl 8:27-37. DOI: 10.1002/prot.21662. View