Evolution of Genome Size and Complexity in Pinus
Overview
Authors
Affiliations
Background: Genome evolution in the gymnosperm lineage of seed plants has given rise to many of the most complex and largest plant genomes, however the elements involved are poorly understood.
Methodology/principal Findings: Gymny is a previously undescribed retrotransposon family in Pinus that is related to Athila elements in Arabidopsis. Gymny elements are dispersed throughout the modern Pinus genome and occupy a physical space at least the size of the Arabidopsis thaliana genome. In contrast to previously described retroelements in Pinus, the Gymny family was amplified or introduced after the divergence of pine and spruce (Picea). If retrotransposon expansions are responsible for genome size differences within the Pinaceae, as they are in angiosperms, then they have yet to be identified. In contrast, molecular divergence of Gymny retrotransposons together with other families of retrotransposons can account for the large genome complexity of pines along with protein-coding genic DNA, as revealed by massively parallel DNA sequence analysis of Cot fractionated genomic DNA.
Conclusions/significance: Most of the enormous genome complexity of pines can be explained by divergence of retrotransposons, however the elements responsible for genome size variation are yet to be identified. Genomic resources for Pinus including those reported here should assist in further defining whether and how the roles of retrotransposons differ in the evolution of angiosperm and gymnosperm genomes.
gymnotoa-db: a database and application to optimize functional annotation in gymnosperms.
Mora-Marquez F, Hurtado M, Lopez de Heredia U Database (Oxford). 2025; 2025.
PMID: 40052362 PMC: 11886576. DOI: 10.1093/database/baaf019.
Exploring Taxonomic and Genetic Relationships in the Complex Using Genome Skimming Data.
Sikora J, Celinski K Int J Mol Sci. 2024; 25(18).
PMID: 39337663 PMC: 11432513. DOI: 10.3390/ijms251810178.
From tradition to innovation: conventional and deep learning frameworks in genome annotation.
Chen Z, Ain N, Zhao Q, Zhang X Brief Bioinform. 2024; 25(3).
PMID: 38581418 PMC: 10998533. DOI: 10.1093/bib/bbae138.
LocoGSE, a sequence-based genome size estimator for plants.
Guenzi-Tiberi P, Istace B, Alsos I, Coissac E, Lavergne S, Aury J Front Plant Sci. 2024; 15:1328966.
PMID: 38550287 PMC: 10972871. DOI: 10.3389/fpls.2024.1328966.
Comprehensive Organ-Specific Profiling of Douglas Fir () Proteome.
Teyssier C, Rogier O, Claverol S, Gautier F, Lelu-Walter M, Durufle H Biomolecules. 2023; 13(9).
PMID: 37759800 PMC: 10526743. DOI: 10.3390/biom13091400.