A Stochastic Point Cloud Sampling Method for Multi-Template Protein Comparative Modeling
Authors
Affiliations
Generating tertiary structural models for a target protein from the known structure of its homologous template proteins and their pairwise sequence alignment is a key step in protein comparative modeling. Here, we developed a new stochastic point cloud sampling method, called MTMG, for multi-template protein model generation. The method first superposes the backbones of template structures, and the Cα atoms of the superposed templates form a point cloud for each position of a target protein, which are represented by a three-dimensional multivariate normal distribution. MTMG stochastically resamples the positions for Cα atoms of the residues whose positions are uncertain from the distribution, and accepts or rejects new position according to a simulated annealing protocol, which effectively removes atomic clashes commonly encountered in multi-template comparative modeling. We benchmarked MTMG on 1,033 sequence alignments generated for CASP9, CASP10 and CASP11 targets, respectively. Using multiple templates with MTMG improves the GDT-TS score and TM-score of structural models by 2.96-6.37% and 2.42-5.19% on the three datasets over using single templates. MTMG's performance was comparable to Modeller in terms of GDT-TS score, TM-score, and GDT-HA score, while the average RMSD was improved by a new sampling approach. The MTMG software is freely available at: http://sysbio.rnet.missouri.edu/multicom_toolbox/mtmg.html.
Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14.
Liu J, Wu T, Guo Z, Hou J, Cheng J Proteins. 2021; 90(1):58-72.
PMID: 34291486 PMC: 8671168. DOI: 10.1002/prot.26186.
DeepQA: improving the estimation of single protein model quality with deep belief networks.
Cao R, Bhattacharya D, Hou J, Cheng J BMC Bioinformatics. 2016; 17(1):495.
PMID: 27919220 PMC: 5139030. DOI: 10.1186/s12859-016-1405-y.