» Articles » PMID: 31362700

Reduced Alphabet of Prebiotic Amino Acids Optimally Encodes the Conformational Space of Diverse Extant Protein Folds

Overview
Journal BMC Evol Biol
Publisher Biomed Central
Specialty Biology
Date 2019 Aug 1
PMID 31362700
Citations 11
Authors
Affiliations
Soon will be listed here.
Abstract

Background: There is wide agreement that only a subset of the twenty standard amino acids existed prebiotically in sufficient concentrations to form functional polypeptides. We ask how this subset, postulated as {A,D,E,G,I,L,P,S,T,V}, could have formed structures stable enough to found metabolic pathways. Inspired by alphabet reduction experiments, we undertook a computational analysis to measure the structural coding behavior of sequences simplified by reduced alphabets. We sought to discern characteristics of the prebiotic set that would endow it with unique properties relevant to structure, stability, and folding.

Results: Drawing on a large dataset of single-domain proteins, we employed an information-theoretic measure to assess how well the prebiotic amino acid set preserves fold information against all other possible ten-amino acid sets. An extensive virtual mutagenesis procedure revealed that the prebiotic set excellently preserves sequence-dependent information regarding both backbone conformation and tertiary contact matrix of proteins. We observed that information retention is fold-class dependent: the prebiotic set sufficiently encodes the structure space of α/β and α + β folds, and to a lesser extent, of all-α and all-β folds. The prebiotic set appeared insufficient to encode the small proteins. Assessing how well the prebiotic set discriminates native vs. incorrect sequence-structure matches, we found that α/β and α + β folds exhibit more pronounced energy gaps with the prebiotic set than with nearly all alternatives.

Conclusions: The prebiotic set optimally encodes local backbone structures that appear in the folded environment and near-optimally encodes the tertiary contact matrix of extant proteins. The fold-class-specific patterns observed from our structural analysis confirm the postulated timeline of fold appearance in proteogenesis derived from proteomic sequence analyses. Polypeptides arising in a prebiotic environment will likely form α/β and α + β-like folds if any at all. We infer that the progressive expansion of the alphabet allowed the increased conformational stability and functional specificity of later folds, including all-α, all-β, and small proteins. Our results suggest that prebiotic sequences are amenable to mutations that significantly lower native conformational energies and increase discrimination amidst incorrect folds. This property may have assisted the genesis of functional proto-enzymes prior to the expansion of the full amino acid alphabet.

Citing Articles

The Genetic Code Assembles via Division and Fusion, Basic Cellular Events.

Yarus M Life (Basel). 2023; 13(10).

PMID: 37895450 PMC: 10608286. DOI: 10.3390/life13102069.


Early Selection of the Amino Acid Alphabet Was Adaptively Shaped by Biophysical Constraints of Foldability.

Makarov M, Sanchez Rocha A, Krystufek R, Cherepashuk I, Dzmitruk V, Charnavets T J Am Chem Soc. 2023; 145(9):5320-5329.

PMID: 36826345 PMC: 10017022. DOI: 10.1021/jacs.2c12987.


Modern and prebiotic amino acids support distinct structural profiles in proteins.

Tretyachenko V, Vymetal J, Neuwirthova T, Vondrasek J, Fujishima K, Hlouchova K Open Biol. 2022; 12(6):220040.

PMID: 35728622 PMC: 9213115. DOI: 10.1098/rsob.220040.


Probing the Role of Cysteine Thiyl Radicals in Biology: Eminently Dangerous, Difficult to Scavenge.

Moosmann B, Hajieva P Antioxidants (Basel). 2022; 11(5).

PMID: 35624747 PMC: 9137623. DOI: 10.3390/antiox11050885.


Determination of the Amino Acid Recruitment Order in Early Life by Genome-Wide Analysis of Amino Acid Usage Bias.

Zhao M, Ding R, Liu Y, Ji Z, Zhao Y Biomolecules. 2022; 12(2).

PMID: 35204672 PMC: 8961565. DOI: 10.3390/biom12020171.


References
1.
Baldwin R, Rose G . Is protein folding hierarchic? I. Local structure and peptide folding. Trends Biochem Sci. 1999; 24(1):26-33. DOI: 10.1016/s0968-0004(98)01346-2. View

2.
Govindarajan S, Recabarren R, Goldstein R . Estimating the total number of protein folds. Proteins. 1999; 35(4):408-14. View

3.
Solis A, Rackovsky S . Optimized representations and maximal information in proteins. Proteins. 2000; 38(2):149-64. View

4.
Wolf Y, Grishin N, Koonin E . Estimating the number of protein folds and families from complete genome data. J Mol Biol. 2000; 299(4):897-905. DOI: 10.1006/jmbi.2000.3786. View

5.
Silverman J, Balakrishnan R, Harbury P . Reverse engineering the (beta/alpha )8 barrel fold. Proc Natl Acad Sci U S A. 2001; 98(6):3092-7. PMC: 30612. DOI: 10.1073/pnas.041613598. View