» Articles » PMID: 29342277

Finding Nemo: Hybrid Assembly with Oxford Nanopore and Illumina Reads Greatly Improves the Clownfish (Amphiprion Ocellaris) Genome Assembly

Overview
Journal Gigascience
Specialties Biology
Genetics
Date 2018 Jan 18
PMID 29342277
Citations 49
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Some of the most widely recognized coral reef fishes are clownfish or anemonefish, members of the family Pomacentridae (subfamily: Amphiprioninae). They are popular aquarium species due to their bright colours, adaptability to captivity, and fascinating behavior. Their breeding biology (sequential hermaphrodites) and symbiotic mutualism with sea anemones have attracted much scientific interest. Moreover, there are some curious geographic-based phenotypes that warrant investigation. Leveraging on the advancement in Nanopore long read technology, we report the first hybrid assembly of the clown anemonefish (Amphiprion ocellaris) genome utilizing Illumina and Nanopore reads, further demonstrating the substantial impact of modest long read sequencing data sets on improving genome assembly statistics.

Results: We generated 43 Gb of short Illumina reads and 9 Gb of long Nanopore reads, representing approximate genome coverage of 54× and 11×, respectively, based on the range of estimated k-mer-predicted genome sizes of between 791 and 967 Mbp. The final assembled genome is contained in 6404 scaffolds with an accumulated length of 880 Mb (96.3% BUSCO-calculated genome completeness). Compared with the Illumina-only assembly, the hybrid approach generated 94% fewer scaffolds with an 18-fold increase in N50 length (401 kb) and increased the genome completeness by an additional 16%. A total of 27 240 high-quality protein-coding genes were predicted from the clown anemonefish, 26 211 (96%) of which were annotated functionally with information from either sequence homology or protein signature searches.

Conclusions: We present the first genome of any anemonefish and demonstrate the value of low coverage (∼11×) long Nanopore read sequencing in improving both genome assembly contiguity and completeness. The near-complete assembly of the A. ocellaris genome will be an invaluable molecular resource for supporting a range of genetic, genomic, and phylogenetic studies specifically for clownfish and more generally for other related fish species of the family Pomacentridae.

Citing Articles

The genome of the sapphire damselfish : a new resource to support further investigation of the evolution of Pomacentrids.

Gairin E, Miura S, Takamiyagi H, Herrera M, Laudet V GigaByte. 2025; 2024():gigabyte144.

PMID: 39791000 PMC: 11711634. DOI: 10.46471/gigabyte.144.


Convergent gene losses and pseudogenizations in multiple lineages of stomachless fishes.

Kato A, Pipil S, Ota C, Kusakabe M, Watanabe T, Nagashima A Commun Biol. 2024; 7(1):408.

PMID: 38570609 PMC: 10991444. DOI: 10.1038/s42003-024-06103-x.


Whole genome assembly and annotation of the King Angelfish () gives insight into the evolution of marine fishes of the Tropical Eastern Pacific.

Gatins R, Arias C, Sanchez C, Bernardi G, De Leon L GigaByte. 2024; 2024:gigabyte115.

PMID: 38550358 PMC: 10973836. DOI: 10.46471/gigabyte.115.


Convergent genomic signatures associated with vertebrate viviparity.

Eastment R, Wong B, McGee M BMC Biol. 2024; 22(1):34.

PMID: 38331819 PMC: 10854053. DOI: 10.1186/s12915-024-01837-w.


Genomic resources for the Yellowfin tuna Thunnus albacares.

Dimens P, Jones K, Margulies D, Scholey V, Cusatti S, McPeak B Mol Biol Rep. 2024; 51(1):232.

PMID: 38281308 DOI: 10.1007/s11033-023-09117-6.


References
1.
Heather J, Chain B . The sequence of sequencers: The history of sequencing DNA. Genomics. 2015; 107(1):1-8. PMC: 4727787. DOI: 10.1016/j.ygeno.2015.11.003. View

2.
Zimin A, Marcais G, Puiu D, Roberts M, Salzberg S, Yorke J . The MaSuRCA genome assembler. Bioinformatics. 2013; 29(21):2669-77. PMC: 3799473. DOI: 10.1093/bioinformatics/btt476. View

3.
Holt C, Yandell M . MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics. 2011; 12:491. PMC: 3280279. DOI: 10.1186/1471-2105-12-491. View

4.
Simao F, Waterhouse R, Ioannidis P, Kriventseva E, Zdobnov E . BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015; 31(19):3210-2. DOI: 10.1093/bioinformatics/btv351. View

5.
Wood D, Salzberg S . Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol. 2014; 15(3):R46. PMC: 4053813. DOI: 10.1186/gb-2014-15-3-r46. View