» Articles » PMID: 24084645

A Conserved Extraordinarily Long Serine Homopolymer in Dictyostelid Amoebae

Overview
Specialty Genetics
Date 2013 Oct 3
PMID 24084645
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Eukaryotic protein sequences often contain amino-acid homopolymers that consist of a single amino acid repeated from several to dozens of times. Some of these are functional but others may persist largely because of high expansion rates due to DNA slippage. However, very long homopolymers with over a hundred repeats are very rare. We report an extraordinarily long homopolymer consisting of 306 tandem serine repeats from the single-celled eukaryote Dictyostelium discoideum, which also has a multicellular stage. The gene has a paralog with 132 repeats and orthologs, also with high serine repeat numbers, in various other Dictyostelid species. The conserved gene structure and protein sequences suggest that the homopolymer is functional. The high codon diversity and very poor alignment of serine codons in this gene between species similarly indicate functionality. This is because the serine homopolymer is conserved despite much DNA sequence change. A survey of other very long amino-acid homopolymers in eukaryotes shows that high codon diversity is the rule, suggesting that these too may be functional.

Citing Articles

Mutation and selection processes regulating short tandem repeats give rise to genetic and phenotypic diversity across species.

Verbiest M, Maksimov M, Jin Y, Anisimova M, Gymrek M, Bilgin Sonay T J Evol Biol. 2022; 36(2):321-336.

PMID: 36289560 PMC: 9990875. DOI: 10.1111/jeb.14106.


The intrinsic disorder alphabet. III. Dual personality of serine.

Uversky V Intrinsically Disord Proteins. 2017; 3(1):e1027032.

PMID: 28232888 PMC: 5314895. DOI: 10.1080/21690707.2015.1027032.

References
1.
Parikh A, Miranda E, Katoh-Kurasawa M, Fuller D, Rot G, Zagar L . Conserved developmental transcriptomes in evolutionarily divergent species. Genome Biol. 2010; 11(3):R35. PMC: 2864575. DOI: 10.1186/gb-2010-11-3-r35. View

2.
Remm M, Storm C, Sonnhammer E . Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. J Mol Biol. 2001; 314(5):1041-52. DOI: 10.1006/jmbi.2000.5197. View

3.
Blanco E, Parra G, Guigo R . Using geneid to identify genes. Curr Protoc Bioinformatics. 2008; Chapter 4:Unit 4.3. DOI: 10.1002/0471250953.bi0403s18. View

4.
Fondon 3rd J, Hammock E, Hannan A, King D . Simple sequence repeats: genetic modulators of brain function and behavior. Trends Neurosci. 2008; 31(7):328-34. DOI: 10.1016/j.tins.2008.03.006. View

5.
Blom N, Gammeltoft S, Brunak S . Sequence and structure-based prediction of eukaryotic protein phosphorylation sites. J Mol Biol. 1999; 294(5):1351-62. DOI: 10.1006/jmbi.1999.3310. View