» Articles » PMID: 37993882

A Pangenome Graph Reference of 30 Chicken Genomes Allows Genotyping of Large and Complex Structural Variants

Abstract

Background: The red junglefowl, the wild outgroup of domestic chickens, has historically served as a reference for genomic studies of domestic chickens. These studies have provided insight into the etiology of traits of commercial importance. However, the use of a single reference genome does not capture diversity present among modern breeds, many of which have accumulated molecular changes due to drift and selection. While reference-based resequencing is well-suited to cataloging simple variants such as single-nucleotide changes and short insertions and deletions, it is mostly inadequate to discover more complex structural variation in the genome.

Methods: We present a pangenome for the domestic chicken consisting of thirty assemblies of chickens from different breeds and research lines.

Results: We demonstrate how this pangenome can be used to catalog structural variants present in modern breeds and untangle complex nested variation. We show that alignment of short reads from 100 diverse wild and domestic chickens to this pangenome reduces reference bias by 38%, which affects downstream genotyping results. This approach also allows for the accurate genotyping of a large and complex pair of structural variants at the K feathering locus using short reads, which would not be possible using a linear reference.

Conclusions: We expect that this new paradigm of genomic reference will allow better pinpointing of exact mutations responsible for specific phenotypes, which will in turn be necessary for breeding chickens that meet new sustainability criteria and are resilient to quickly evolving pathogen threats.

Citing Articles

Comparative population pangenomes reveal unexpected complexity and fitness effects of structural variants.

Edwards S, Fang B, Khost D, Kolyfetis G, Cheek R, Deraad D bioRxiv. 2025; .

PMID: 39990470 PMC: 11844517. DOI: 10.1101/2025.02.11.637762.


Near telomere-to-telomere genome assemblies of Silkie Gallus gallus and Mallard Anas platyrhynchos restored the structure of chromosomes and "missing" genes in birds.

Zhao Q, Yin Z, Hou Z J Anim Sci Biotechnol. 2025; 16(1):9.

PMID: 39828703 PMC: 11745021. DOI: 10.1186/s40104-024-01141-1.


Chromosome-level Genome Assembly of Korean Long-tailed Chicken and Pangenome of 40 Gallus gallus Assemblies.

Shin H, Park W, Chai H, Lee Y, Jung J, Ko B Sci Data. 2025; 12(1):51.

PMID: 39799174 PMC: 11724944. DOI: 10.1038/s41597-024-04287-9.


Pangenome graphs and their applications in biodiversity genomics.

Secomandi S, Gallo G, Rossi R, Rodriguez Fernandes C, Jarvis E, Bonisoli-Alquati A Nat Genet. 2025; 57(1):13-26.

PMID: 39779953 DOI: 10.1038/s41588-024-02029-6.


Genome-Wide Structural Variation Analysis and Breed Comparison of Local Domestic Ducks in Shandong Province, China.

Ren P, Zhang M, Khan M, Yang L, Jing Y, Liu X Animals (Basel). 2025; 14(24.

PMID: 39765561 PMC: 11672513. DOI: 10.3390/ani14243657.


References
1.
Wang K, Hu H, Tian Y, Li J, Scheben A, Zhang C . The Chicken Pan-Genome Reveals Gene Content Variation and a Promoter Region Deletion in IGF2BP1 Affecting Body Size. Mol Biol Evol. 2021; 38(11):5066-5081. PMC: 8557422. DOI: 10.1093/molbev/msab231. View

2.
Zhang J, Nie C, Li X, Zhao X, Jia Y, Han J . Comprehensive analysis of structural variants in chickens using PacBio sequencing. Front Genet. 2022; 13:971588. PMC: 9632285. DOI: 10.3389/fgene.2022.971588. View

3.
Garrison E, Siren J, Novak A, Hickey G, Eizenga J, Dawson E . Variation graph toolkit improves read mapping by representing genetic variation in the reference. Nat Biotechnol. 2018; 36(9):875-879. PMC: 6126949. DOI: 10.1038/nbt.4227. View

4.
Siren J, Paten B . GBZ file format for pangenome graphs. Bioinformatics. 2022; 38(22):5012-5018. PMC: 9665857. DOI: 10.1093/bioinformatics/btac656. View

5.
Fulton J, Mason A, Wolc A, Arango J, Settar P, Lund A . The impact of endogenous Avian Leukosis Viruses (ALVE) on production traits in elite layer lines. Poult Sci. 2021; 100(6):101121. PMC: 8131724. DOI: 10.1016/j.psj.2021.101121. View