» Articles » PMID: 11835484

CASP2 Knowledge-based Approach to Distant Homology Recognition and Fold Prediction in CASP4

Overview
Journal Proteins
Date 2002 Feb 9
PMID 11835484
Citations 8
Authors
Affiliations
Soon will be listed here.
Abstract

In 1996, in CASP2, we presented a semimanual approach to the prediction of protein structure that was aimed at the recognition of probable distant homology, where it existed, between a given target protein and a protein of known structure (Murzin and Bateman, Proteins 1997; Suppl 1:105-112). Central to our method was the knowledge of all known structural and probable evolutionary relationships among proteins of known structure classified in the SCOP database (Murzin et al., J Mol Biol 1995;247:536-540). It was demonstrated that a knowledge-based approach could compete successfully with the best computational methods of the time in the correct recognition of the target protein fold. Four years later, in CASP4, we have applied essentially the same knowledge-based approach to distant homology recognition, concentrating our effort on the improvement of the completeness and alignment accuracy of our models. The manifold increase of available sequence and structure data was to our advantage, as well as was the experience and expertise obtained through the classification of these data. In particular, we were able to model most of our predictions from several distantly related structures rather than from a single parent structure, and we could use more superfamily characteristic features for the refinement of our alignments. Our predictions for each of the attempted distant homology recognition targets ranked among the few top predictions for each of these targets, with the predictions for the hypothetical protein HI0065 (T0104) and the C-terminal domain of the ABC transporter MalK (T0121C) being particularly successful. We also have attempted the prediction of protein folds of some of the targets tentatively assigned to new superfamilies. The average quality of our fold predictions was far less than the quality of our distant homology recognition models, but for the two targets, chorismate lyase (T0086) and Appr>p cyclic phosphodiesterase (T0094), our predictions achieved the top ranking.

Citing Articles

OMPcontact: An Outer Membrane Protein Inter-Barrel Residue Contact Prediction Method.

Zhang L, Wang H, Yan L, Su L, Xu D J Comput Biol. 2016; 24(3):217-228.

PMID: 27513917 PMC: 5346958. DOI: 10.1089/cmb.2015.0236.


Transmembrane protein alignment and fold recognition based on predicted topology.

Wang H, He Z, Zhang C, Zhang L, Xu D PLoS One. 2013; 8(7):e69744.

PMID: 23894534 PMC: 3716705. DOI: 10.1371/journal.pone.0069744.


I-TASSER: fully automated protein structure prediction in CASP8.

Zhang Y Proteins. 2009; 77 Suppl 9:100-13.

PMID: 19768687 PMC: 2782770. DOI: 10.1002/prot.22588.


Fishing with (Proto)Net-a principled approach to protein target selection.

Linial M Comp Funct Genomics. 2008; 4(5):542-8.

PMID: 18629007 PMC: 2447289. DOI: 10.1002/cfg.328.


I-TASSER server for protein 3D structure prediction.

Zhang Y BMC Bioinformatics. 2008; 9:40.

PMID: 18215316 PMC: 2245901. DOI: 10.1186/1471-2105-9-40.