» Articles » PMID: 24653211

Unique Features of the Loblolly Pine (Pinus Taeda L.) Megagenome Revealed Through Sequence Annotation

Abstract

The largest genus in the conifer family Pinaceae is Pinus, with over 100 species. The size and complexity of their genomes (∼20-40 Gb, 2n = 24) have delayed the arrival of a well-annotated reference sequence. In this study, we present the annotation of the first whole-genome shotgun assembly of loblolly pine (Pinus taeda L.), which comprises 20.1 Gb of sequence. The MAKER-P annotation pipeline combined evidence-based alignments and ab initio predictions to generate 50,172 gene models, of which 15,653 are classified as high confidence. Clustering these gene models with 13 other plant species resulted in 20,646 gene families, of which 1554 are predicted to be unique to conifers. Among the conifer gene families, 159 are composed exclusively of loblolly pine members. The gene models for loblolly pine have the highest median and mean intron lengths of 24 fully sequenced plant genomes. Conifer genomes are full of repetitive DNA, with the most significant contributions from long-terminal-repeat retrotransposons. In depth analysis of the tandem and interspersed repetitive content yielded a combined estimate of 82%.

Citing Articles

Genetic Architecture Underlying Response to the Fungal Pathogen in Lodgepole Pine, Jack Pine, and Their Hybrids.

Lu M, Feau N, Lind B, Obreht Vidakovic D, Singh P, Aitken S Evol Appl. 2025; 18(2):e70078.

PMID: 39925618 PMC: 11802335. DOI: 10.1111/eva.70078.


Optimising Exome Captures in Species With Large Genomes Using Species-Specific Repetitive DNA Blocker.

Kesalahti R, Kumpula T, Cervantes S, Kujala S, Mattila T, Tyrmi J Mol Ecol Resour. 2024; 25(3):e14053.

PMID: 39692189 PMC: 11887611. DOI: 10.1111/1755-0998.14053.


Current status and trends in forest genomics.

Borthakur D, Busov V, Cao X, Du Q, Gailing O, Isik F For Res (Fayettev). 2024; 2:11.

PMID: 39525413 PMC: 11524260. DOI: 10.48130/FR-2022-0011.


Colonization of root endophytic fungus improves drought tolerance of seedlings by regulating metabolome and proteome.

Wu C, Yang Y, Wang Y, Zhang W, Sun H Front Microbiol. 2024; 15:1294833.

PMID: 38559354 PMC: 10978793. DOI: 10.3389/fmicb.2024.1294833.


Long-insert sequence capture detects high copy numbers in a defence-related beta-glucosidase gene βglu-1 with large variations in white spruce but not Norway spruce.

Hung T, Wu E, Zeltins P, Jansons A, Ullah A, Erbilgin N BMC Genomics. 2024; 25(1):118.

PMID: 38281030 PMC: 10821269. DOI: 10.1186/s12864-024-09978-6.


References
1.
Hao D, Yang L, Xiao P . The first insight into the Taxus genome via fosmid library construction and end sequencing. Mol Genet Genomics. 2011; 285(3):197-205. DOI: 10.1007/s00438-010-0598-4. View

2.
Nystedt B, Street N, Wetterbom A, Zuccolo A, Lin Y, Scofield D . The Norway spruce genome sequence and conifer genome evolution. Nature. 2013; 497(7451):579-84. DOI: 10.1038/nature12211. View

3.
Zimin A, Marcais G, Puiu D, Roberts M, Salzberg S, Yorke J . The MaSuRCA genome assembler. Bioinformatics. 2013; 29(21):2669-77. PMC: 3799473. DOI: 10.1093/bioinformatics/btt476. View

4.
Friesen N, Brandes A, Heslop-Harrison J . Diversity, origin, and distribution of retrotransposons (gypsy and copia) in conifers. Mol Biol Evol. 2001; 18(7):1176-88. DOI: 10.1093/oxfordjournals.molbev.a003905. View

5.
Liu J, He Y, Amasino R, Chen X . siRNAs targeting an intronic transposon in the regulation of natural flowering behavior in Arabidopsis. Genes Dev. 2004; 18(23):2873-8. PMC: 534648. DOI: 10.1101/gad.1217304. View