» Articles » PMID: 28821183

Improvement of the Threespine Stickleback Genome Using a Hi-C-Based Proximity-Guided Assembly

Overview
Journal J Hered
Specialty Genetics
Date 2017 Aug 20
PMID 28821183
Citations 34
Authors
Affiliations
Soon will be listed here.
Abstract

Scaffolding genomes into complete chromosome assemblies remains challenging even with the rapidly increasing sequence coverage generated by current next-generation sequence technologies. Even with scaffolding information, many genome assemblies remain incomplete. The genome of the threespine stickleback (Gasterosteus aculeatus), a fish model system in evolutionary genetics and genomics, is not completely assembled despite scaffolding with high-density linkage maps. Here, we first test the ability of a Hi-C based proximity-guided assembly (PGA) to perform a de novo genome assembly from relatively short contigs. Using Hi-C based PGA, we generated complete chromosome assemblies from a distribution of short contigs (20-100 kb). We found that 96.40% of contigs were correctly assigned to linkage groups (LGs), with ordering nearly identical to the previous genome assembly. Using available bacterial artificial chromosome (BAC) end sequences, we provide evidence that some of the few discrepancies between the Hi-C assembly and the existing assembly are due to structural variation between the populations used for the 2 assemblies or errors in the existing assembly. This Hi-C assembly also allowed us to improve the existing assembly, assigning over 60% (13.35 Mb) of the previously unassigned (~21.7 Mb) contigs to LGs. Together, our results highlight the potential of the Hi-C based PGA method to be used in combination with short read data to perform relatively inexpensive de novo genome assemblies. This approach will be particularly useful in organisms in which it is difficult to perform linkage mapping or to obtain high molecular weight DNA required for other scaffolding methods.

Citing Articles

A pangenome reveals LTR repeat dynamics as a major driver of genome evolution in Chenopodium.

Jaggi K, Krak K, Storchova H, Mandak B, Marcheschi A, Belyayev A Plant Genome. 2025; 18(1):e70010.

PMID: 40018873 PMC: 11869160. DOI: 10.1002/tpg2.70010.


Genome Sequence of a Marine Threespine Stickleback () from Rabbit Slough in the Cook Inlet.

Au E, Weaver S, Katikaneni A, Wucherpfennig J, Luo Y, Mangan R bioRxiv. 2025; .

PMID: 39975098 PMC: 11839064. DOI: 10.1101/2025.02.06.636934.


Genomes of Aegilops umbellulata provide new insights into unique structural variations and genetic diversity in the U-genome for wheat improvement.

Singh J, Gudi S, Maughan P, Liu Z, Kolmer J, Wang M Plant Biotechnol J. 2024; 22(12):3505-3519.

PMID: 39292731 PMC: 11606429. DOI: 10.1111/pbi.14470.


A chromosome-level reference genome of the Antarctic blackfin icefish Chaenocephalus aceratus.

Lee S, Kim J, Choi E, Jo E, Cho M, Kim J Sci Data. 2023; 10(1):657.

PMID: 37752129 PMC: 10522714. DOI: 10.1038/s41597-023-02561-w.


Single-Cell RNA Sequencing Reveals Microevolution of the Stickleback Immune System.

Fuess L, Bolnick D Genome Biol Evol. 2023; 15(4).

PMID: 37039516 PMC: 10116603. DOI: 10.1093/gbe/evad053.


References
1.
Zhang Q, Chen W, Sun L, Zhao F, Huang B, Yang W . The genome of Prunus mume. Nat Commun. 2012; 3:1318. PMC: 3535359. DOI: 10.1038/ncomms2290. View

2.
Bickhart D, Rosen B, Koren S, Sayre B, Hastie A, Chan S . Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome. Nat Genet. 2017; 49(4):643-650. PMC: 5909822. DOI: 10.1038/ng.3802. View

3.
Kent W . BLAT--the BLAST-like alignment tool. Genome Res. 2002; 12(4):656-64. PMC: 187518. DOI: 10.1101/gr.229202. View

4.
Dudchenko O, Batra S, Omer A, Nyquist S, Hoeger M, Durand N . De novo assembly of the genome using Hi-C yields chromosome-length scaffolds. Science. 2017; 356(6333):92-95. PMC: 5635820. DOI: 10.1126/science.aal3327. View

5.
Ross J, Peichel C . Molecular cytogenetic evidence of rearrangements on the Y chromosome of the threespine stickleback fish. Genetics. 2008; 179(4):2173-82. PMC: 2516089. DOI: 10.1534/genetics.108.088559. View