Dipeptide Frequencies in Proteins and the CpG Deficiency in Vertebrate DNA
Overview
Affiliations
Analysis of vertebrate protein sequences totalling 4040 residues shows that amino acids with a high proportion of codons ending in C occur with significantly reduced frequency before amino acids whose codons start with G. This effect is not shown by "control" bacterial protein sequences. The consequent implication of shortage of XXC. GXX codon pairs in vertebrate messenger RNA is discussed in relation to the extreme rarity of the base doublet CpG in vertebrate DNA.
Blaisdell B J Mol Evol. 1983; 19(3-4):226-36.
PMID: 6887265 DOI: 10.1007/BF02099970.
CpG frequency in large DNA segments.
Lennon G, Fraser N J Mol Evol. 1983; 19(3-4):286-8.
PMID: 6577204 DOI: 10.1007/BF02099976.
The distribution of the dinucleotide CpG and cytosine methylation in the vitellogenin gene family.
Cooper D, Nardelli D, Schubiger J, Wahli W J Mol Evol. 1987; 25(2):107-15.
PMID: 3116270 DOI: 10.1007/BF02101752.