» Articles » PMID: 39799174

Chromosome-level Genome Assembly of Korean Long-tailed Chicken and Pangenome of 40 Gallus Gallus Assemblies

Overview
Journal Sci Data
Date 2025 Jan 11
PMID 39799174
Authors
Affiliations
Soon will be listed here.
Abstract

This study presents the first chromosome-level genome assembly of the Korean long-tailed chicken (KLC), a unique breed of Gallus gallus known as Ginkkoridak. Our assembly achieved a super contig N50 of 5.7 Mbp and a scaffold N50 exceeding 90 Mb, with a genome completeness of 96.3% as assessed by BUSCO using the aves_odb10 set. We also constructed a comprehensive pangenome graph, incorporating 40 Gallus gallus assemblies, including the KLC genome. This graph comprises 87,934,214 nodes, 121,720,974 edges, and a total sequence length of 1,709,850,352 bp. Notably, our KLC assembly contributed 1,919,925 bp of new sequences to the pangenome, underscoring the unique genetic makeup of this breed. Furthermore, in comparison with the pangenome, we identified 36,818 structural variants in KLC, which included 2,529 insertions, 27,743 deletions, and 6,546 of either insertions or deletions shorter than 1 kb. We also successfully identified pan-genome wide non-reference sequences. Our KLC assembly and pangenome graph provide valuable genomic resources for studying G. gallus populations.

References
1.
Chen N . Using RepeatMasker to identify repetitive elements in genomic sequences. Curr Protoc Bioinformatics. 2008; Chapter 4:Unit 4.10. DOI: 10.1002/0471250953.bi0410s05. View

2.
Li M, Sun C, Xu N, Bian P, Tian X, Wang X . De Novo Assembly of 20 Chicken Genomes Reveals the Undetectable Phenomenon for Thousands of Core Genes on Microchromosomes and Subtelomeric Regions. Mol Biol Evol. 2022; 39(4). PMC: 9021737. DOI: 10.1093/molbev/msac066. View

3.
Bolger A, Lohse M, Usadel B . Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014; 30(15):2114-20. PMC: 4103590. DOI: 10.1093/bioinformatics/btu170. View

4.
Rice E, Alberdi A, Alfieri J, Athrey G, Balacco J, Bardou P . A pangenome graph reference of 30 chicken genomes allows genotyping of large and complex structural variants. BMC Biol. 2023; 21(1):267. PMC: 10664547. DOI: 10.1186/s12915-023-01758-0. View

5.
Rhie A, McCarthy S, Fedrigo O, Damas J, Formenti G, Koren S . Towards complete and error-free genome assemblies of all vertebrate species. Nature. 2021; 592(7856):737-746. PMC: 8081667. DOI: 10.1038/s41586-021-03451-0. View