» Articles » PMID: 39715741

Chromosome-level Genome Assembly of the Highly-polymorphic Peacock Blenny (Salaria Pavo)

Overview
Journal Sci Data
Specialty Science
Date 2024 Dec 23
PMID 39715741
Authors
Affiliations
Soon will be listed here.
Abstract

The peacock blenny Salaria pavo is notorious for its extreme male sexual polymorphism, with large males defending nests and younger reproductive males mimicking the appearance and behavior of females to parasitically fertilize eggs. The lack of a reference genome has, to date, limited the understanding of the genetic basis of the species phenotypic plasticity. Here, we present the first reference genome assembly of the peacock blenny using PacBio HiFi long-reads and Hi-C sequencing data. The final assembly of the S. pavo genome spanned 735.90 Mbp, with a contig N50 of 3.69 Mbp and a scaffold N50 of 31.87 Mbp. A total of 98.77% of the assembly was anchored to 24 chromosomes. In total, 24,008 protein-coding genes were annotated, and 99.0% of BUSCO genes were fully represented. Comparative analyses with closely related species showed that 86.2% of these genes were assigned to orthogroups. This high-quality genome of S. pavo will be a valuable resource for future research on this species' reproductive plasticity and evolutionary history.

Citing Articles

Chromosome-level genome assembly of the highly-polymorphic peacock blenny (Salaria pavo).

Cardoso S, Jiang C, Sun L, Zhang L, Goncalves D Sci Data. 2024; 11(1):1424.

PMID: 39715741 PMC: 11666546. DOI: 10.1038/s41597-024-04242-8.

References
1.
Zhang X, Zhang S, Zhao Q, Ming R, Tang H . Assembly of allele-aware, chromosomal-scale autopolyploid genomes based on Hi-C data. Nat Plants. 2019; 5(8):833-845. DOI: 10.1038/s41477-019-0487-8. View

2.
Kim D, Paggi J, Park C, Bennett C, Salzberg S . Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol. 2019; 37(8):907-915. PMC: 7605509. DOI: 10.1038/s41587-019-0201-4. View

3.
Rhie A, Walenz B, Koren S, Phillippy A . Merqury: reference-free quality, completeness, and phasing assessment for genome assemblies. Genome Biol. 2020; 21(1):245. PMC: 7488777. DOI: 10.1186/s13059-020-02134-9. View

4.
Flynn J, Hubley R, Goubert C, Rosen J, Clark A, Feschotte C . RepeatModeler2 for automated genomic discovery of transposable element families. Proc Natl Acad Sci U S A. 2020; 117(17):9451-9457. PMC: 7196820. DOI: 10.1073/pnas.1921046117. View

5.
Emms D, Kelly S . OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 2019; 20(1):238. PMC: 6857279. DOI: 10.1186/s13059-019-1832-y. View