The Common Origin of Symmetry and Structure in Genetic Sequences
Affiliations
Biologists have long sought a way to explain how statistical properties of genetic sequences emerged and are maintained through evolution. On the one hand, non-random structures at different scales indicate a complex genome organisation. On the other hand, single-strand symmetry has been scrutinised using neutral models in which correlations are not considered or irrelevant, contrary to empirical evidence. Different studies investigated these two statistical features separately, reaching minimal consensus despite sustained efforts. Here we unravel previously unknown symmetries in genetic sequences, which are organized hierarchically through scales in which non-random structures are known to be present. These observations are confirmed through the statistical analysis of the human genome and explained through a simple domain model. These results suggest that domain models which account for the cumulative action of mobile elements can explain simultaneously non-random structures and symmetries in genetic sequences.
Pflughaupt P, Sahakyan A Nucleic Acids Res. 2023; 51(14):7409-7423.
PMID: 37293966 PMC: 10415130. DOI: 10.1093/nar/gkad477.
A role for circular code properties in translation.
Giannerini S, Gonzalez D, Goracci G, Danielli A Sci Rep. 2021; 11(1):9218.
PMID: 33911089 PMC: 8080828. DOI: 10.1038/s41598-021-87534-y.
Driven progressive evolution of genome sequence complexity in Cyanobacteria.
Moya A, Oliver J, Verdu M, Delaye L, Arnau V, Bernaola-Galvan P Sci Rep. 2020; 10(1):19073.
PMID: 33149190 PMC: 7643063. DOI: 10.1038/s41598-020-76014-4.
DNA sequence symmetries from randomness: the origin of the Chargaff's second parity rule.
Fariselli P, Taccioli C, Pagani L, Maritan A Brief Bioinform. 2020; 22(2):2172-2181.
PMID: 32266404 PMC: 7986665. DOI: 10.1093/bib/bbaa041.