» Articles » PMID: 22174733

Hyperdimensional Analysis of Amino Acid Pair Distributions in Proteins

Overview
Journal PLoS One
Date 2011 Dec 17
PMID 22174733
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

Our manuscript presents a novel approach to protein structure analyses. We have organized an 8-dimensional data cube with protein 3D-structural information from 8706 high-resolution non-redundant protein-chains with the aim of identifying packing rules at the amino acid pair level. The cube contains information about amino acid type, solvent accessibility, spatial and sequence distance, secondary structure and sequence length. We are able to pose structural queries to the data cube using program ProPack. The response is a 1, 2 or 3D graph. Whereas the response is of a statistical nature, the user can obtain an instant list of all PDB-structures where such pair is found. The user may select a particular structure, which is displayed highlighting the pair in question. The user may pose millions of different queries and for each one he will receive the answer in a few seconds. In order to demonstrate the capabilities of the data cube as well as the programs, we have selected well known structural features, disulphide bridges and salt bridges, where we illustrate how the queries are posed, and how answers are given. Motifs involving cysteines such as disulphide bridges, zinc-fingers and iron-sulfur clusters are clearly identified and differentiated. ProPack also reveals that whereas pairs of Lys residues virtually never appear in close spatial proximity, pairs of Arg are abundant and appear at close spatial distance, contrasting the belief that electrostatic repulsion would prevent this juxtaposition and that Arg-Lys is perceived as a conservative mutation. The presented programs can find and visualize novel packing preferences in proteins structures allowing the user to unravel correlations between pairs of amino acids. The new tools allow the user to view statistical information and visualize instantly the structures that underpin the statistical information, which is far from trivial with most other SW tools for protein structure analysis.

Citing Articles

Scale-free behaviour of amino acid pair interactions in folded proteins.

Petersen S, Neves-Petersen M, Henriksen S, Mortensen R, Geertz-Hansen H PLoS One. 2012; 7(7):e41322.

PMID: 22848462 PMC: 3406053. DOI: 10.1371/journal.pone.0041322.

References
1.
Lenffer J, Lai P, El Mejaber W, Khan A, Koh J, Tan P . CysView: protein classification based on cysteine pairing patterns. Nucleic Acids Res. 2004; 32(Web Server issue):W350-5. PMC: 441613. DOI: 10.1093/nar/gkh475. View

2.
Kass I, Horovitz A . Mapping pathways of allosteric communication in GroEL by analysis of correlated mutations. Proteins. 2002; 48(4):611-7. DOI: 10.1002/prot.10180. View

3.
Halperin I, Wolfson H, Nussinov R . Correlated mutations: advances and limitations. A study on fusion proteins and on the Cohesin-Dockerin families. Proteins. 2006; 63(4):832-45. DOI: 10.1002/prot.20933. View

4.
Richardson J . The anatomy and taxonomy of protein structure. Adv Protein Chem. 1981; 34:167-339. DOI: 10.1016/s0065-3233(08)60520-3. View

5.
Sander C, Schneider R . Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins. 1991; 9(1):56-68. DOI: 10.1002/prot.340090107. View