» Articles » PMID: 27161489

A Stochastic Point Cloud Sampling Method for Multi-Template Protein Comparative Modeling

Overview
Journal Sci Rep
Specialty Science
Date 2016 May 11
PMID 27161489
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Generating tertiary structural models for a target protein from the known structure of its homologous template proteins and their pairwise sequence alignment is a key step in protein comparative modeling. Here, we developed a new stochastic point cloud sampling method, called MTMG, for multi-template protein model generation. The method first superposes the backbones of template structures, and the Cα atoms of the superposed templates form a point cloud for each position of a target protein, which are represented by a three-dimensional multivariate normal distribution. MTMG stochastically resamples the positions for Cα atoms of the residues whose positions are uncertain from the distribution, and accepts or rejects new position according to a simulated annealing protocol, which effectively removes atomic clashes commonly encountered in multi-template comparative modeling. We benchmarked MTMG on 1,033 sequence alignments generated for CASP9, CASP10 and CASP11 targets, respectively. Using multiple templates with MTMG improves the GDT-TS score and TM-score of structural models by 2.96-6.37% and 2.42-5.19% on the three datasets over using single templates. MTMG's performance was comparable to Modeller in terms of GDT-TS score, TM-score, and GDT-HA score, while the average RMSD was improved by a new sampling approach. The MTMG software is freely available at: http://sysbio.rnet.missouri.edu/multicom_toolbox/mtmg.html.

Citing Articles

Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14.

Liu J, Wu T, Guo Z, Hou J, Cheng J Proteins. 2021; 90(1):58-72.

PMID: 34291486 PMC: 8671168. DOI: 10.1002/prot.26186.


DeepQA: improving the estimation of single protein model quality with deep belief networks.

Cao R, Bhattacharya D, Hou J, Cheng J BMC Bioinformatics. 2016; 17(1):495.

PMID: 27919220 PMC: 5139030. DOI: 10.1186/s12859-016-1405-y.

References
1.
Welch B . The generalisation of student's problems when several different population variances are involved. Biometrika. 2010; 34(1-2):28-35. DOI: 10.1093/biomet/34.1-2.28. View

2.
Xu D, Zhang Y . Improving the physical realism and structural accuracy of protein models by a two-step atomic-level energy minimization. Biophys J. 2011; 101(10):2525-34. PMC: 3218324. DOI: 10.1016/j.bpj.2011.10.024. View

3.
Biegert A, Soding J . Sequence context-specific profiles for homology searching. Proc Natl Acad Sci U S A. 2009; 106(10):3770-5. PMC: 2645910. DOI: 10.1073/pnas.0810767106. View

4.
Zhang Y, Skolnick J . Scoring function for automated assessment of protein structure template quality. Proteins. 2004; 57(4):702-10. DOI: 10.1002/prot.20264. View

5.
Chen V, Arendall 3rd W, Headd J, Keedy D, Immormino R, Kapral G . MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr D Biol Crystallogr. 2010; 66(Pt 1):12-21. PMC: 2803126. DOI: 10.1107/S0907444909042073. View