» Articles » PMID: 30361485

The Common Origin of Symmetry and Structure in Genetic Sequences

Overview
Journal Sci Rep
Specialty Science
Date 2018 Oct 27
PMID 30361485
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Biologists have long sought a way to explain how statistical properties of genetic sequences emerged and are maintained through evolution. On the one hand, non-random structures at different scales indicate a complex genome organisation. On the other hand, single-strand symmetry has been scrutinised using neutral models in which correlations are not considered or irrelevant, contrary to empirical evidence. Different studies investigated these two statistical features separately, reaching minimal consensus despite sustained efforts. Here we unravel previously unknown symmetries in genetic sequences, which are organized hierarchically through scales in which non-random structures are known to be present. These observations are confirmed through the statistical analysis of the human genome and explained through a simple domain model. These results suggest that domain models which account for the cumulative action of mobile elements can explain simultaneously non-random structures and symmetries in genetic sequences.

Citing Articles

Generalised interrelations among mutation rates drive the genomic compliance of Chargaff's second parity rule.

Pflughaupt P, Sahakyan A Nucleic Acids Res. 2023; 51(14):7409-7423.

PMID: 37293966 PMC: 10415130. DOI: 10.1093/nar/gkad477.


A role for circular code properties in translation.

Giannerini S, Gonzalez D, Goracci G, Danielli A Sci Rep. 2021; 11(1):9218.

PMID: 33911089 PMC: 8080828. DOI: 10.1038/s41598-021-87534-y.


Driven progressive evolution of genome sequence complexity in Cyanobacteria.

Moya A, Oliver J, Verdu M, Delaye L, Arnau V, Bernaola-Galvan P Sci Rep. 2020; 10(1):19073.

PMID: 33149190 PMC: 7643063. DOI: 10.1038/s41598-020-76014-4.


DNA sequence symmetries from randomness: the origin of the Chargaff's second parity rule.

Fariselli P, Taccioli C, Pagani L, Maritan A Brief Bioinform. 2020; 22(2):2172-2181.

PMID: 32266404 PMC: 7986665. DOI: 10.1093/bib/bbaa041.

References
1.
Chargaff E . Structure and function of nucleic acids as cell constituents. Fed Proc. 1951; 10(3):654-9. View

2.
Mitchell D, Bridge R . A test of Chargaff's second rule. Biochem Biophys Res Commun. 2005; 340(1):90-4. DOI: 10.1016/j.bbrc.2005.11.160. View

3.
Bogachev M, Kayumov A, Bunde A . Universal internucleotide statistics in full genomes: a footprint of the DNA structure and packaging?. PLoS One. 2014; 9(12):e112534. PMC: 4249851. DOI: 10.1371/journal.pone.0112534. View

4.
Tavares A, Pinho A, M Silva R, Rodrigues J, Bastos C, Ferreira P . DNA word analysis based on the distribution of the distances between symmetric words. Sci Rep. 2017; 7(1):728. PMC: 5428789. DOI: 10.1038/s41598-017-00646-2. View

5.
Peng C, Buldyrev S, Goldberger A, Havlin S, Sciortino F, Simons M . Long-range correlations in nucleotide sequences. Nature. 1992; 356(6365):168-70. DOI: 10.1038/356168a0. View