» Articles » PMID: 15883880

Phylogenetic Differences in Content and Intensity of Periodic Proteins

Overview
Journal J Mol Evol
Specialty Biochemistry
Date 2005 May 11
PMID 15883880
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Many proteins exhibit sequence periodicity, often correlated with a visible structural periodicity. The statistical significance of such periodicity can be assessed by means of a chi-squared-based test, with significance thresholds being calculated from shuffled sequences. Comparison of the complete proteomes of 45 species reveals striking differences in the proportion of periodic proteins and the intensity of the most significant periodicities. Eukaryotes tend to have a higher proportion of periodic proteins than eubacteria, which in turn tend to have more than archaea. The intensity of periodicity in the most periodic proteins is also greatest in eukaryotes. By contrast, the relatively small group of periodic proteins in archaea also tend to be weakly periodic compared to those of eukaryotes and eubacteria. Exceptions to this general rule are found in those prokaryotes with multicellular life-cycle phases, e.g., Methanosarcina sp., or Anabaena sp., which have more periodicities than prokaryotes in general, and in unicellular eukaryotes, which have fewer than multicellular eukaryotes. The distribution of significantly periodic proteins in eukaryotes is over a wide range of period lengths, whereas prokaryotic proteins typically have a more limited set of period lengths. This is further investigated by repeating the analysis on the NRL-3D database of proteins of solved structure. Some short-range periodicities are explicable in terms of basic secondary structure, e.g., alpha helices, while middle-range periodicities are frequently found to consist of known short Pfam domains, e.g., leucine-rich repeats, tetratricopeptides or armadillo domains. However, not all can be explained in this way.

Citing Articles

Expansion of tandem repeats in sea anemone Nematostella vectensis proteome: A source for gene novelty?.

Naamati G, Fromer M, Linial M BMC Genomics. 2009; 10:593.

PMID: 20003297 PMC: 2805694. DOI: 10.1186/1471-2164-10-593.


XSTREAM: a practical algorithm for identification and architecture modeling of tandem repeats in protein sequences.

Newman A, Cooper J BMC Bioinformatics. 2007; 8:382.

PMID: 17931424 PMC: 2233649. DOI: 10.1186/1471-2105-8-382.

References
1.
Ohno S . Repeats of base oligomers as the primordial coding sequences of the primeval earth and their vestiges in modern genes. J Mol Evol. 1984; 20(3-4):313-21. DOI: 10.1007/BF02104737. View

2.
Trifonov E . Segmented structure of protein sequences and early evolution of genome by combinatorial fusion of DNA elements. J Mol Evol. 1995; 40(3):337-42. DOI: 10.1007/BF00163239. View

3.
Coward E, Drablos F . Detecting periodic patterns in biological sequences. Bioinformatics. 1998; 14(6):498-507. DOI: 10.1093/bioinformatics/14.6.498. View

4.
Eisenberg D, Weiss R, Terwilliger T . The hydrophobic moment detects periodicity in protein hydrophobicity. Proc Natl Acad Sci U S A. 1984; 81(1):140-4. PMC: 344626. DOI: 10.1073/pnas.81.1.140. View

5.
White S, Jacobs R . The evolution of proteins from random amino acid sequences. I. Evidence from the lengthwise distribution of amino acids in modern protein sequences. J Mol Evol. 1993; 36(1):79-95. DOI: 10.1007/BF02407307. View