» Articles » PMID: 24808625

NEW MDS AND CLUSTERING BASED ALGORITHMS FOR PROTEIN MODEL QUALITY ASSESSMENT AND SELECTION

Overview
Publisher World Scientific
Date 2014 May 9
PMID 24808625
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

In protein tertiary structure prediction, assessing the quality of predicted models is an essential task. Over the past years, many methods have been proposed for the protein model quality assessment (QA) and selection problem. Despite significant advances, the discerning power of current methods is still unsatisfactory. In this paper, we propose two new algorithms, CC-Select and MDS-QA, based on multidimensional scaling and -means clustering. For the model selection problem, CC-Select combines consensus with clustering techniques to select the best models from a given pool. Given a set of predicted models, CC-Select first calculates a consensus score for each structure based on its average pairwise structural similarity to other models. Then, similar structures are grouped into clusters using multidimensional scaling and clustering algorithms. In each cluster, the one with the highest consensus score is selected as a candidate model. For the QA problem, MDS-QA combines single-model scoring functions with consensus to determine more accurate assessment score for every model in a given pool. Using extensive benchmark sets of a large collection of predicted models, we compare the two algorithms with existing state-of-the-art quality assessment methods and show significant improvement.

Citing Articles

MQAPRank: improved global protein model quality assessment by learning-to-rank.

Jing X, Dong Q BMC Bioinformatics. 2017; 18(1):275.

PMID: 28545390 PMC: 5445322. DOI: 10.1186/s12859-017-1691-z.

References
1.
Kalman M, Ben-Tal N . Quality assessment of protein model-structures using evolutionary conservation. Bioinformatics. 2010; 26(10):1299-307. PMC: 2865859. DOI: 10.1093/bioinformatics/btq114. View

2.
Zemla A . LGA: A method for finding 3D similarities in protein structures. Nucleic Acids Res. 2003; 31(13):3370-4. PMC: 168977. DOI: 10.1093/nar/gkg571. View

3.
Zhang J, Wang Q, Barz B, He Z, Kosztin I, Shang Y . MUFOLD: A new solution for protein 3D structure prediction. Proteins. 2009; 78(5):1137-52. PMC: 2885889. DOI: 10.1002/prot.22634. View

4.
Zhang J, Zhang Y . A novel side-chain orientation dependent potential derived from random-walk reference state for protein fold selection and structure prediction. PLoS One. 2010; 5(10):e15386. PMC: 2965178. DOI: 10.1371/journal.pone.0015386. View

5.
Zhou H, Zhou Y . Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction. Protein Sci. 2002; 11(11):2714-26. PMC: 2373736. DOI: 10.1110/ps.0217002. View