» Articles » PMID: 14562959

Analysis of Sequence Periodicity in E. Coli Proteins: Empirical Investigation of the "duplication and Divergence" Theory of Protein Evolution

Overview
Journal J Mol Evol
Specialty Biochemistry
Date 2003 Oct 18
PMID 14562959
Citations 3
Authors
Affiliations
Soon will be listed here.
Abstract

Periodicity was quantified in 4289 Escherichia coli K12 confirmed and putative protein sequences, using a simple chi-square technique previously shown to reveal triplet period periodicity in coding DNA. Periodicities were calculated from period n = 2 to period n = 50 in nine different alphabetic representations of the proteins. By comparison with a randomly generated proteome of the same compositional content, the E. coli proteome does not contain a significant excess of periodic proteins. However, 60 proteins do appear to be significantly periodic in at least one alphabetic representation, after Bonferroni correction, at p < 0.01, and 30 at p < 0.001. These are compared with significantly periodic proteins of solved three-dimensional structure, detected by an identical analysis of the sequences from a protein structure database. It is concluded that there is no evidence for the presence of a proteome-wide quasi-periodicity as predicted by the "duplication and divergence" model of protein evolution and that the major periodicity detected is a consequence of the repetitive tendencies within alpha-helices. However, it is not possible to explain all sequence periodicities in terms of observable secondary structure, as in cases where sequence periodicity can be compared to solved structure, there is often no structural regularity that would provide an obvious explanation in terms of natural selection on protein function.

Citing Articles

Profile-statistical periodicity of DNA coding regions.

Chaley M, Kutyrkin V DNA Res. 2011; 18(5):353-62.

PMID: 21788253 PMC: 3190956. DOI: 10.1093/dnares/dsr023.


Proteome-wide prediction of novel DNA/RNA-binding proteins using amino acid composition and periodicity in the hyperthermophilic archaeon Pyrococcus furiosus.

Fujishima K, Komasa M, Kitamura S, Suzuki H, Tomita M, Kanai A DNA Res. 2007; 14(3):91-102.

PMID: 17573465 PMC: 2779898. DOI: 10.1093/dnares/dsm011.


Phylogenetic differences in content and intensity of periodic proteins.

Gatherer D, McEwan N J Mol Evol. 2005; 60(4):447-61.

PMID: 15883880 DOI: 10.1007/s00239-004-0189-2.

References
1.
Ohno S . Repeats of base oligomers as the primordial coding sequences of the primeval earth and their vestiges in modern genes. J Mol Evol. 1984; 20(3-4):313-21. DOI: 10.1007/BF02104737. View

2.
Eisenberg D, Weiss R, Terwilliger T . The hydrophobic moment detects periodicity in protein hydrophobicity. Proc Natl Acad Sci U S A. 1984; 81(1):140-4. PMC: 344626. DOI: 10.1073/pnas.81.1.140. View

3.
Stanfel L . A new approach to clustering the amino acids. J Theor Biol. 1996; 183(2):195-205. DOI: 10.1006/jtbi.1996.0213. View

4.
Berman H, Westbrook J, Feng Z, Gilliland G, Bhat T, Weissig H . The Protein Data Bank. Nucleic Acids Res. 1999; 28(1):235-42. PMC: 102472. DOI: 10.1093/nar/28.1.235. View

5.
Zhurkin V . Periodicity in DNA primary structure is defined by secondary structure of the coded protein. Nucleic Acids Res. 1981; 9(8):1963-71. PMC: 326816. DOI: 10.1093/nar/9.8.1963. View