» Articles » PMID: 12885659

TOUCHSTONE II: a New Approach to Ab Initio Protein Structure Prediction

Overview
Journal Biophys J
Publisher Cell Press
Specialty Biophysics
Date 2003 Jul 30
PMID 12885659
Citations 123
Authors
Affiliations
Soon will be listed here.
Abstract

We have developed a new combined approach for ab initio protein structure prediction. The protein conformation is described as a lattice chain connecting C(alpha) atoms, with attached C(beta) atoms and side-chain centers of mass. The model force field includes various short-range and long-range knowledge-based potentials derived from a statistical analysis of the regularities of protein structures. The combination of these energy terms is optimized through the maximization of correlation for 30 x 60,000 decoys between the root mean square deviation (RMSD) to native and energies, as well as the energy gap between native and the decoy ensemble. To accelerate the conformational search, a newly developed parallel hyperbolic sampling algorithm with a composite movement set is used in the Monte Carlo simulation processes. We exploit this strategy to successfully fold 41/100 small proteins (36 approximately 120 residues) with predicted structures having a RMSD from native below 6.5 A in the top five cluster centroids. To fold larger-size proteins as well as to improve the folding yield of small proteins, we incorporate into the basic force field side-chain contact predictions from our threading program PROSPECTOR where homologous proteins were excluded from the data base. With these threading-based restraints, the program can fold 83/125 test proteins (36 approximately 174 residues) with structures having a RMSD to native below 6.5 A in the top five cluster centroids. This shows the significant improvement of folding by using predicted tertiary restraints, especially when the accuracy of side-chain contact prediction is >20%. For native fold selection, we introduce quantities dependent on the cluster density and the combination of energy and free energy, which show a higher discriminative power to select the native structure than the previously used cluster energy or cluster size, and which can be used in native structure identification in blind simulations. These procedures are readily automated and are being implemented on a genomic scale.

Citing Articles

Homology modeling of Forkhead box protein C2: identification of potential inhibitors using ligand and structure-based virtual screening.

Tarek Ibrahim M, Lee J, Tao P Mol Divers. 2022; 27(4):1661-1674.

PMID: 36048303 PMC: 9975119. DOI: 10.1007/s11030-022-10519-0.


Improving fragment-based ab initio protein structure assembly using low-accuracy contact-map predictions.

Mortuza S, Zheng W, Zhang C, Li Y, Pearce R, Zhang Y Nat Commun. 2021; 12(1):5011.

PMID: 34408149 PMC: 8373938. DOI: 10.1038/s41467-021-25316-w.


Folding non-homologous proteins by coupling deep-learning contact maps with I-TASSER assembly simulations.

Zheng W, Zhang C, Li Y, Pearce R, Bell E, Zhang Y Cell Rep Methods. 2021; 1(3).

PMID: 34355210 PMC: 8336924. DOI: 10.1016/j.crmeth.2021.100014.


Toward the solution of the protein structure prediction problem.

Pearce R, Zhang Y J Biol Chem. 2021; 297(1):100870.

PMID: 34119522 PMC: 8254035. DOI: 10.1016/j.jbc.2021.100870.


Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.

Li Y, Zhang C, Bell E, Zheng W, Zhou X, Yu D PLoS Comput Biol. 2021; 17(3):e1008865.

PMID: 33770072 PMC: 8026059. DOI: 10.1371/journal.pcbi.1008865.


References
1.
Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H . The Protein Data Bank. Nucleic Acids Res. 1999; 28(1):235-42. PMC: 102472. DOI: 10.1093/nar/28.1.235. View

2.
Vendruscolo M, Najmanovich R, Domany E . Can a pairwise contact potential stabilize native protein folds against decoys obtained by threading?. Proteins. 2000; 38(2):134-48. DOI: 10.1002/(sici)1097-0134(20000201)38:2<134::aid-prot3>3.0.co;2-a. View

3.
Panchenko A, BRYANT S . Combination of threading potentials and sequence profiles improves fold recognition. J Mol Biol. 2000; 296(5):1319-31. DOI: 10.1006/jmbi.2000.3541. View

4.
Baker D . A surprising simplicity to protein folding. Nature. 2000; 405(6782):39-42. DOI: 10.1038/35011000. View

5.
Tobi D, Elber R . Distance-dependent, pair potential for protein folding: results from linear optimization. Proteins. 2000; 41(1):40-6. View