» Articles » PMID: 37141209

The Effect of the Genomic GC Content Bias of Prokaryotic Organisms on the Secondary Structures of Their Proteins

Overview
Journal PLoS One
Date 2023 May 4
PMID 37141209
Authors
Affiliations
Soon will be listed here.
Abstract

One of the main characteristics of prokaryotic genomes is the ratio in which guanine-cytosine bases are used in their DNA sequences. This is known as the genomic GC content and varies widely, from values below 20% to values greater than 74%. It has been demonstrated that the genomic GC content varies in accordance with the phylogenetic distribution of organisms and influences the amino acid composition of their corresponding proteomes. This bias is particularly important for amino acids that are coded by GC content-rich codons such as alanine, glycine, and proline, as well as amino acids that are coded by AT-rich codons, such as lysine, asparagine, and isoleucine. In our study, we extend these results by considering the effect of the genomic GC content on the secondary structure of proteins. On a set of 192 representative prokaryotic genomes and proteome sequences, we identified through a bioinformatic study that the composition of the secondary structures of the proteomes varies in relation to the genomic GC content; random coils increase as the genomic GC content increases, while alpha-helices and beta-sheets present an inverse relationship. In addition, we found that the tendency of an amino acid to form part of a secondary structure of proteins is not ubiquitous, as previously expected, but varies according to the genomic GC content. Finally, we discovered that for some specific groups of orthologous proteins, the GC content of genes biases the composition of secondary structures of the proteins for which they code.

References
1.
Tatusov R, Fedorova N, Jackson J, Jacobs A, Kiryutin B, Koonin E . The COG database: an updated version includes eukaryotes. BMC Bioinformatics. 2003; 4:41. PMC: 222959. DOI: 10.1186/1471-2105-4-41. View

2.
Levitt M . Conformational preferences of amino acids in globular proteins. Biochemistry. 1978; 17(20):4277-85. DOI: 10.1021/bi00613a026. View

3.
Chen W, Shao Y, Chen F . Evolution of complete proteomes: guanine-cytosine pressure, phylogeny and environmental influences blend the proteomic architecture. BMC Evol Biol. 2013; 13:219. PMC: 3850711. DOI: 10.1186/1471-2148-13-219. View

4.
Yan R, Xu D, Yang J, Walker S, Zhang Y . A comparative assessment and analysis of 20 representative sequence alignment methods for protein structure prediction. Sci Rep. 2013; 3:2619. PMC: 3965362. DOI: 10.1038/srep02619. View

5.
Bernardi G . Codon usage and genome composition. J Mol Evol. 1985; 22(4):363-5. DOI: 10.1007/BF02115693. View