» Articles » PMID: 24731387

Designing and Evaluating the MULTICOM Protein Local and Global Model Quality Prediction Methods in the CASP10 Experiment

Overview
Journal BMC Struct Biol
Publisher Biomed Central
Date 2014 Apr 16
PMID 24731387
Citations 26
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Protein model quality assessment is an essential component of generating and using protein structural models. During the Tenth Critical Assessment of Techniques for Protein Structure Prediction (CASP10), we developed and tested four automated methods (MULTICOM-REFINE, MULTICOM-CLUSTER, MULTICOM-NOVEL, and MULTICOM-CONSTRUCT) that predicted both local and global quality of protein structural models.

Results: MULTICOM-REFINE was a clustering approach that used the average pairwise structural similarity between models to measure the global quality and the average Euclidean distance between a model and several top ranked models to measure the local quality. MULTICOM-CLUSTER and MULTICOM-NOVEL were two new support vector machine-based methods of predicting both the local and global quality of a single protein model. MULTICOM-CONSTRUCT was a new weighted pairwise model comparison (clustering) method that used the weighted average similarity between models in a pool to measure the global model quality. Our experiments showed that the pairwise model assessment methods worked better when a large portion of models in the pool were of good quality, whereas single-model quality assessment methods performed better on some hard targets when only a small portion of models in the pool were of reasonable quality.

Conclusions: Since digging out a few good models from a large pool of low-quality models is a major challenge in protein structure prediction, single model quality assessment methods appear to be poised to make important contributions to protein structure modeling. The other interesting finding was that single-model quality assessment scores could be used to weight the models by the consensus pairwise model comparison method to improve its accuracy.

Citing Articles

Computational Approaches for Identification of Potential Plant Bioactives as Novel G6PD Inhibitors Using Advanced Tools and Databases.

Aldossari R, Ali A, Rehman M, Rashid S, Ahmad S Molecules. 2023; 28(7).

PMID: 37049781 PMC: 10096328. DOI: 10.3390/molecules28073018.


MULTICOM2 open-source protein structure prediction system powered by deep learning and distance prediction.

Wu T, Liu J, Guo Z, Hou J, Cheng J Sci Rep. 2021; 11(1):13155.

PMID: 34162922 PMC: 8222248. DOI: 10.1038/s41598-021-92395-6.


QMEANDisCo-distance constraints applied on model quality estimation.

Studer G, Rempfer C, Waterhouse A, Gumienny R, Haas J, Schwede T Bioinformatics. 2019; 36(6):1765-1771.

PMID: 31697312 PMC: 7075525. DOI: 10.1093/bioinformatics/btz828.


4mCpred-EL: An Ensemble Learning Framework for Identification of DNA -methylcytosine Sites in the Mouse Genome.

Manavalan B, Basith S, Shin T, Lee D, Wei L, Lee G Cells. 2019; 8(11).

PMID: 31661923 PMC: 6912380. DOI: 10.3390/cells8111332.


Protein model accuracy estimation based on local structure quality assessment using 3D convolutional neural network.

Sato R, Ishida T PLoS One. 2019; 14(9):e0221347.

PMID: 31487288 PMC: 6728020. DOI: 10.1371/journal.pone.0221347.


References
1.
Zhang Y, Skolnick J . Automated structure prediction of weakly homologous proteins on a genomic scale. Proc Natl Acad Sci U S A. 2004; 101(20):7594-9. PMC: 419651. DOI: 10.1073/pnas.0305695101. View

2.
Eisenhaber F, Persson B, Argos P . Protein structure prediction: recognition of primary, secondary, and tertiary structural features from amino acid sequence. Crit Rev Biochem Mol Biol. 1995; 30(1):1-94. DOI: 10.3109/10409239509085139. View

3.
Kryshtafovych A, Barbato A, Fidelis K, Monastyrskyy B, Schwede T, Tramontano A . Assessment of the assessment: evaluation of the model quality estimates in CASP10. Proteins. 2013; 82 Suppl 2:112-26. PMC: 4406045. DOI: 10.1002/prot.24347. View

4.
Kalman M, Ben-Tal N . Quality assessment of protein model-structures using evolutionary conservation. Bioinformatics. 2010; 26(10):1299-307. PMC: 2865859. DOI: 10.1093/bioinformatics/btq114. View

5.
Zhang Y, Skolnick J . Scoring function for automated assessment of protein structure template quality. Proteins. 2004; 57(4):702-10. DOI: 10.1002/prot.20264. View