» Articles » PMID: 39665690

A Graph-based Goat Pangenome Reveals Structural Variations Involved in Domestication and Adaptation

Abstract

Pangenomes can facilitate a deeper understanding of genome complexity. Using de novo phased long-read assemblies of eight representative goat breeds, we constructed a graph-based pangenome of goats (Capra hircus) and discovered 113-Mb autosomal novel sequences. Combining this multi-assembly pangenome with low-coverage PacBio HiFi sequences, we constructed a long-read structural variations (SVs) database containing 59,325 SV deletions, 84,910 SV insertions, and 24,954 other complex SV alleles. This resource allowed reliable graph-based genotyping from short reads of 79 wild and 1,148 worldwide domestic goats. Selection signal analysis of SV captured a novel immune-related domestication locus containing the galectin-9 gene and extra copies of the ruminant-specific galectin-9-like genes (LGALS9L), which have high tissue specificity. A segmental duplication in domestic goats generates three additional LGALS9L copies. Ancient goat genome sequences show a gradual increase in frequency of this duplication from the Neolithic to the present. Two other newly detected SVs also have higher selection signals than adjacent SNPs, a truncated-LINE1 deletion in EDAR2 associated with cashmere production and a VNTR-related insertion in PAPSS2 linked to high-altitude adaptation. In summary, the multi-assembly goat pangenome and long-read SV database facilitates detecting complex variations that are important in evolution and selection.

References
1.
Dai X, Bian P, Hu D, Luo F, Huang Y, Jiao S . A Chinese indicine pangenome reveals a wealth of novel structural variants introgressed from other species. Genome Res. 2023; 33(8):1284-1298. PMC: 10547261. DOI: 10.1101/gr.277481.122. View

2.
Foissac S, Djebali S, Munyard K, Vialaneix N, Rau A, Muret K . Multi-species annotation of transcriptome and chromatin structure in domesticated animals. BMC Biol. 2019; 17(1):108. PMC: 6936065. DOI: 10.1186/s12915-019-0726-5. View

3.
Tettelin H, Riley D, Cattuto C, Medini D . Comparative genomics: the bacterial pan-genome. Curr Opin Microbiol. 2008; 11(5):472-7. DOI: 10.1016/j.mib.2008.09.006. View

4.
Yanai I, Benjamin H, Shmoish M, Chalifa-Caspi V, Shklar M, Ophir R . Genome-wide midrange transcription profiles reveal expression level relationships in human tissue specification. Bioinformatics. 2004; 21(5):650-9. DOI: 10.1093/bioinformatics/bti042. View

5.
Worley K . A golden goat genome. Nat Genet. 2017; 49(4):485-486. DOI: 10.1038/ng.3824. View