» Articles » PMID: 37875130

Assembly and Annotation of the Black Spruce Genome Provide Insights on Spruce Phylogeny and Evolution of Stress Response

Abstract

Black spruce (Picea mariana [Mill.] B.S.P.) is a dominant conifer species in the North American boreal forest that plays important ecological and economic roles. Here, we present the first genome assembly of P. mariana with a reconstructed genome size of 18.3 Gbp and NG50 scaffold length of 36.0 kbp. A total of 66,332 protein-coding sequences were predicted in silico and annotated based on sequence homology. We analyzed the evolutionary relationships between P. mariana and 5 other spruces for which complete nuclear and organelle genome sequences were available. The phylogenetic tree estimated from mitochondrial genome sequences agrees with biogeography; specifically, P. mariana was strongly supported as a sister lineage to P. glauca and 3 other taxa found in western North America, followed by the European Picea abies. We obtained mixed topologies with weaker statistical support in phylogenetic trees estimated from nuclear and chloroplast genome sequences, indicative of ancient reticulate evolution affecting these 2 genomes. Clustering of protein-coding sequences from the 6 Picea taxa and 2 Pinus species resulted in 34,776 orthogroups, 560 of which appeared to be specific to P. mariana. Analysis of these specific orthogroups and dN/dS analysis of positive selection signatures for 497 single-copy orthogroups identified gene functions mostly related to plant development and stress response. The P. mariana genome assembly and annotation provides a valuable resource for forest genetics research and applications in this broadly distributed species, especially in relation to climate adaptation.

References
1.
Neale D, Kremer A . Forest tree genomics: growing resources and applications. Nat Rev Genet. 2011; 12(2):111-22. DOI: 10.1038/nrg2931. View

2.
Molhoj M, Jorgensen B, Ulvskov P, Borkhardt B . Two Arabidopsis thaliana genes, KOR2 and KOR3, which encode membrane-anchored endo-1,4-beta-D-glucanases, are differentially expressed in developing leaf trichomes and their support cells. Plant Mol Biol. 2001; 46(3):263-75. DOI: 10.1023/a:1010688726755. View

3.
Pelgas B, Bousquet J, Meirmans P, Ritland K, Isabel N . QTL mapping in white spruce: gene maps and genomic regions underlying adaptive traits across pedigrees, years and environments. BMC Genomics. 2011; 12:145. PMC: 3068112. DOI: 10.1186/1471-2164-12-145. View

4.
Sullivan A, Eldfjell Y, Schiffthaler B, Delhomme N, Asp T, Hebelstrup K . The Mitogenome of Norway Spruce and a Reappraisal of Mitochondrial Recombination in Plants. Genome Biol Evol. 2019; 12(1):3586-3598. PMC: 6944214. DOI: 10.1093/gbe/evz263. View

5.
Niu S, Li J, Bo W, Yang W, Zuccolo A, Giacomello S . The Chinese pine genome and methylome unveil key features of conifer evolution. Cell. 2021; 185(1):204-217.e14. DOI: 10.1016/j.cell.2021.12.006. View