» Articles » PMID: 32637481

Dataset for Genome Sequencing and De Novo Assembly of the Vietnamese Bighead Catfish ( Günther, 1864)

Overview
Journal Data Brief
Date 2020 Jul 9
PMID 32637481
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Freshwater catfish of the genus , known as the airbreathing catfish, are widespread and important for food security through small scale inland fisheries and aquaculture. Limited genomic data are available for this important group of fishes. The bighead catfish () is a commercial aquaculture species in southeast Asia used for aquaculture and threatened in its natural environment through habitat destruction, over-exploitation and competition from other introduced species of . Despite its commercial importance and threats to natural populations, public databases do not include any genomic data for . We present the first genomic data for the bighead catfish from Illumina sequencing. A total of 128 Gb of sequence data in paired-end 150 bp reads were assembled , generating a final assembly of 883 Mbp contained in 27,833 scaffolds (N length: 80.8 kbp) with BUSCO completeness assessments of 96.3% and 87.6% based on metazoan and Actinopterygii ortholog datasets, respectively. Annotation of the genome predicted 21,124 gene sequences, which were assigned putative functions based on homology to existing protein sequences in public databases. Raw fastq reads and the final version of the genome assembly have been deposited in the NCBI (BioProject: PRJNA604477, WGS: JAAGKR000000000, SRA: SRR11188453). The complete mitochondrial genome was also recovered from the same sequence read dataset and is available on NCBI (accession: MT109097), representing the first mitogenome for this species. Lastly, we find an expansion of the and genes thought to be associated with adaptations to air-breathing and a semi-terrestrial life style in this genus of catfish.

Citing Articles

Genome sequencing and assembly of near threatened Clarias dussumieri (Valenciennes, 1840), an endemic catfish of peninsular India.

Mohindra V, Chowdhury L, Charan R, Basheer V, Jena J Sci Data. 2024; 11(1):1406.

PMID: 39702573 PMC: 11659475. DOI: 10.1038/s41597-024-04272-2.


The complete mitochondrial genome of the blackskin catfish (: Clariidae) from Rokan River, Riau, Indonesia.

Marnis H, Syahputra K, Iswanto B, Cartealy I, Sularto , Darmawan J Mitochondrial DNA B Resour. 2024; 9(8):1093-1097.

PMID: 39165382 PMC: 11334743. DOI: 10.1080/23802359.2024.2392742.


Mitochondriomics of Fishes (Siluriformes: Clariidae) with a New Assembly of : Insights into the Genetic Characterization and Diversification.

De Alwis P, Kundu S, Gietbong F, Amin M, Lee S, Kim H Life (Basel). 2023; 13(2).

PMID: 36836839 PMC: 9960581. DOI: 10.3390/life13020482.


Chromosome-level assembly and annotation of the blue catfish Ictalurus furcatus, an aquaculture species for hybrid catfish reproduction, epigenetics, and heterosis studies.

Wang H, Su B, Butts I, Dunham R, Wang X Gigascience. 2022; 11.

PMID: 35809049 PMC: 9270728. DOI: 10.1093/gigascience/giac070.


Endogenic upregulations of HIF/VEGF signaling pathway genes promote air breathing organ angiogenesis in bimodal respiration fish.

Huang S, Yang L, Zhang L, Sun B, Gao J, Chen Z Funct Integr Genomics. 2021; 22(1):65-76.

PMID: 34839401 DOI: 10.1007/s10142-021-00822-8.

References
1.
Kim O, Nguyen P, Shoguchi E, Hisata K, Vo T, Inoue J . A draft genome of the striped catfish, Pangasianodon hypophthalmus, for comparative analysis of genes relevant to development and a resource for aquaculture improvement. BMC Genomics. 2018; 19(1):733. PMC: 6173838. DOI: 10.1186/s12864-018-5079-x. View

2.
Vurture G, Sedlazeck F, Nattestad M, Underwood C, Fang H, Gurtowski J . GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics. 2017; 33(14):2202-2204. PMC: 5870704. DOI: 10.1093/bioinformatics/btx153. View

3.
Grabherr M, Haas B, Yassour M, Levin J, Thompson D, Amit I . Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011; 29(7):644-52. PMC: 3571712. DOI: 10.1038/nbt.1883. View

4.
Jones P, Binns D, Chang H, Fraser M, Li W, McAnulla C . InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014; 30(9):1236-40. PMC: 3998142. DOI: 10.1093/bioinformatics/btu031. View

5.
Bolger A, Lohse M, Usadel B . Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014; 30(15):2114-20. PMC: 4103590. DOI: 10.1093/bioinformatics/btu170. View