» Articles » PMID: 19749023

Genomic Selection in Admixed and Crossbred Populations

Overview
Journal J Anim Sci
Date 2009 Sep 15
PMID 19749023
Citations 72
Authors
Affiliations
Soon will be listed here.
Abstract

In livestock, genomic selection (GS) has primarily been investigated by simulation of purebred populations. Traits of interest are, however, often measured in crossbred or mixed populations with uncertain breed composition. If such data are used as the training data for GS without accounting for breed composition, estimates of marker effects may be biased due to population stratification and admixture. To investigate this, a genome of 100 cM was simulated with varying marker densities (5 to 40 segregating markers per cM). After 1,000 generations of random mating in a population of effective size 500, 4 lines with effective size 100 were isolated and mated for another 50 generations to create 4 pure breeds. These breeds were used to generate combined, F(1), F(2), 3- and 4-way crosses, and admixed training data sets of 1,000 individuals with phenotypes for an additive trait controlled by 100 segregating QTL and heritability of 0.30. The validation data set was a sample of 1,000 genotyped individuals from one pure breed. Method Bayes-B was used to simultaneously estimate the effects of all markers for breeding value estimation. With 5 (40) markers per cM, the correlation of true with estimated breeding value of selection candidates (accuracy) was greatest, 0.79 (0.85), when data from the same pure breed were used for training. When the training data set consisted of crossbreds, the accuracy ranged from 0.66 (0.79) to 0.74 (0.83) for the 2 marker densities, respectively. The admixed training data set resulted in nearly the same accuracies as when training was in the breed to which selection candidates belonged. However, accuracy was greatly reduced when genes from the target pure breed were not included in the admixed or crossbred population. This implies that, with high-density markers, admixed and crossbred populations can be used to develop GS prediction equations for all pure breeds that contributed to the population, without a substantial loss of accuracy compared with training on purebred data, even if breed origin has not been explicitly taken into account. In addition, using GS based on high-density marker data, purebreds can be accurately selected for crossbred performance without the need for pedigree or breed information. Results also showed that haplotype segments with strong linkage disequilibrium are shorter in crossbred and admixed populations than in purebreds, providing opportunities for QTL fine mapping.

Citing Articles

Accuracy of Genomic Predictions Cross Populations with Different Linkage Disequilibrium Patterns.

Jin L, Xu L, Jin H, Zhao S, Jia Y, Li J Genes (Basel). 2024; 15(11).

PMID: 39596619 PMC: 11594128. DOI: 10.3390/genes15111419.


Machine Learning for the Genomic Prediction of Growth Traits in a Composite Beef Cattle Population.

Hay E Animals (Basel). 2024; 14(20).

PMID: 39457945 PMC: 11505319. DOI: 10.3390/ani14203014.


Effective population size in field pea.

Johnson J, Piche L, Worral H, Atanda S, Coyne C, McGee R BMC Genomics. 2024; 25(1):695.

PMID: 39009980 PMC: 11251210. DOI: 10.1186/s12864-024-10587-6.


Population genomics of American mink using genotype data.

Hu G, Do D, Manafiazar G, Kelvin A, Sargolzaei M, Plastow G Front Genet. 2023; 14:1175408.

PMID: 37274788 PMC: 10234291. DOI: 10.3389/fgene.2023.1175408.


Genomic prediction in pigs using data from a commercial crossbred population: insights from the Duroc x (Landrace x Yorkshire) three-way crossbreeding system.

Liu S, Yao T, Chen D, Xiao S, Chen L, Zhang Z Genet Sel Evol. 2023; 55(1):21.

PMID: 36977978 PMC: 10053053. DOI: 10.1186/s12711-023-00794-2.