» Articles » PMID: 38002978

A High-Quality Reference Genome Assembly of (Rosaceae)

Overview
Journal Genes (Basel)
Publisher MDPI
Date 2023 Nov 25
PMID 38002978
Authors
Affiliations
Soon will be listed here.
Abstract

This study introduces a meticulously constructed genome assembly at the chromosome level for the Rosaceae family species , a traditional Chinese medicinal herb. The final assembly encompasses 1272.71 megabases (Mb) distributed across 16 pseudochromosomes, boasting contig and super-scaffold N50 values of 2.77 and 79.32 Mb, respectively. Annotated within this genome is a substantial 875.99 Mb of repetitive sequences, with transposable elements occupying 777.28 Mb, constituting 61.07% of the entire genome. Our predictive efforts identified 49,261 protein-coding genes within the repeat-masked assembly, with 45,256 (91.87%) having functional annotations, 5127 (10.41%) demonstrating tandem duplication, and 2373 (4.82%) classified as transcription factor genes. Additionally, our investigation unveiled 3080 non-coding RNAs spanning 0.51 Mb of the genome sequences. According to our evolutionary study, underwent recent whole-genome duplication following its separation from . The presented reference-level genome assembly and annotation for will significantly facilitate the in-depth exploration of genomic information pertaining to this species, offering substantial utility in comparative genomics and evolutionary analyses involving Rosaceae species.

References
1.
Wang M, Zhang L, Tong S, Jiang D, Fu Z . Chromosome-level genome assembly of a xerophytic plant, Haloxylon ammodendron. DNA Res. 2022; 29(2). PMC: 8946665. DOI: 10.1093/dnares/dsac006. View

2.
Simao F, Waterhouse R, Ioannidis P, Kriventseva E, Zdobnov E . BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015; 31(19):3210-2. DOI: 10.1093/bioinformatics/btv351. View

3.
Ellinghaus D, Kurtz S, Willhoeft U . LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinformatics. 2008; 9:18. PMC: 2253517. DOI: 10.1186/1471-2105-9-18. View

4.
Louwers M, Splinter E, van Driel R, de Laat W, Stam M . Studying physical chromatin interactions in plants using Chromosome Conformation Capture (3C). Nat Protoc. 2009; 4(8):1216-29. DOI: 10.1038/nprot.2009.113. View

5.
Bairoch A, Apweiler R . The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res. 1999; 28(1):45-8. PMC: 102476. DOI: 10.1093/nar/28.1.45. View