» Articles » PMID: 17914232

Protein Local Conformations Arise from a Mixture of Gaussian Distributions

Overview
Journal J Biosci
Specialties Biochemistry
Biology
Date 2007 Oct 5
PMID 17914232
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

The classical approaches for protein structure prediction rely either on homology of the protein sequence with a template structure or on ab initio calculations for energy minimization. These methods suffer from disadvantages such as the lack of availability of homologous template structures or intractably large conformational search space, respectively. The recently proposed fragment library based approaches first predict the local structures,which can be used in conjunction with the classical approaches of protein structure prediction. The accuracy of the predictions is dependent on the quality of the fragment library. In this work, we have constructed a library of local conformation classes purely based on geometric similarity. The local conformations are represented using Geometric Invariants, properties that remain unchanged under transformations such as translation and rotation, followed by dimension reduction via principal component analysis. The local conformations are then modeled as a mixture of Gaussian probability distribution functions (PDF). Each one of the Gaussian PDF's corresponds to a conformational class with the centroid representing the average structure of that class. We find 46 classes when we use an octapeptide as a unit of local conformation. The protein 3-D structure can now be described as a sequence of local conformational classes. Further, it was of interest to see whether the local conformations can be predicted from the amino acid sequences. To that end,we have analyzed the correlation between sequence features and the conformational classes.

Citing Articles

Protein sequence and structure alignments within one framework.

Schenk G, Margraf T, Torda A Algorithms Mol Biol. 2008; 3:4.

PMID: 18380904 PMC: 2390564. DOI: 10.1186/1748-7188-3-4.

References
1.
Brenner S, Koehl P, Levitt M . The ASTRAL compendium for protein structure and sequence analysis. Nucleic Acids Res. 1999; 28(1):254-6. PMC: 102434. DOI: 10.1093/nar/28.1.254. View

2.
Tendulkar A, Sohoni M, Ogunnaike B, Wangikar P . A geometric invariant-based framework for the analysis of protein conformational space. Bioinformatics. 2005; 21(18):3622-8. DOI: 10.1093/bioinformatics/bti621. View

3.
Terstappen G, Reggiani A . In silico research in drug discovery. Trends Pharmacol Sci. 2001; 22(1):23-6. DOI: 10.1016/s0165-6147(00)01584-4. View

4.
Oldfield T, Hubbard R . Analysis of C alpha geometry in protein structures. Proteins. 1994; 18(4):324-37. DOI: 10.1002/prot.340180404. View

5.
Richardson J . The anatomy and taxonomy of protein structure. Adv Protein Chem. 1981; 34:167-339. DOI: 10.1016/s0065-3233(08)60520-3. View