» Articles » PMID: 30458700

Impact of Rare and Low-frequency Sequence Variants on Reliability of Genomic Prediction in Dairy Cattle

Overview
Journal Genet Sel Evol
Publisher Biomed Central
Specialties Biology
Genetics
Date 2018 Nov 22
PMID 30458700
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Availability of whole-genome sequence data for a large number of cattle and efficient imputation methodologies open a new opportunity to include rare and low-frequency variants (RLFV) in genomic prediction in dairy cattle. The objective of this study was to examine the impact of including RLFV that are within genes and selected from whole-genome sequence variants, on the reliability of genomic prediction for fertility, health and longevity in dairy cattle.

Results: All genic RLFV with a minor allele frequency lower than 0.05 were extracted from imputed sequence data and subsets were created using different strategies. These subsets were subsequently combined with Illumina 50 k single nucleotide polymorphism (SNP) data and used for genomic prediction. Reliability of prediction obtained by using 50 k SNP data alone was used as reference value and absolute changes in reliabilities are referred to as changes in percentage points. Adding a component that included either all the genic or a subset of selected RLFV into the model in addition to the 50 k component changed the reliability of predictions by - 2.2 to 1.1%, i.e. hardly no change in reliability of prediction was found, regardless of how the RLFV were selected. In addition to these empirical analyses, a simulation study was performed to evaluate the potential impact of adding RLFV in the model on the reliability of prediction. Three sets of causal RLFV (containing 21,468, 1348 and 235 RLFV) that were randomly selected from different numbers of genes were generated and accounted for 10% additional genetic variance of the estimated variance explained by the 50 k SNPs. When genic RLFV based on mapping results were included in the prediction model, reliabilities improved by up to 4.0% and when the causal RLFV were included they improved by up to 6.8%.

Conclusions: Using selected RLFV from whole-genome sequence data had only a small impact on the empirical reliability of genomic prediction in dairy cattle. Our simulations revealed that for sequence data to bring a benefit, the key is to identify causal RLFV.

Citing Articles

The effect of marker types and density on genomic prediction and GWAS of key performance traits in tetraploid potato.

Aalborg T, Sverrisdottir E, Kristensen H, Nielsen K Front Plant Sci. 2024; 15:1340189.

PMID: 38525152 PMC: 10957621. DOI: 10.3389/fpls.2024.1340189.


Haplotype blocks for genomic prediction: a comparative evaluation in multiple crop datasets.

Weber S, Frisch M, Snowdon R, Voss-Fels K Front Plant Sci. 2023; 14:1217589.

PMID: 37731980 PMC: 10507710. DOI: 10.3389/fpls.2023.1217589.


Impact of linkage disequilibrium heterogeneity along the genome on genomic prediction and heritability estimation.

Ren D, Cai X, Lin Q, Ye H, Teng J, Li J Genet Sel Evol. 2022; 54(1):47.

PMID: 35761182 PMC: 9235212. DOI: 10.1186/s12711-022-00737-3.


On the use of whole-genome sequence data for across-breed genomic prediction and fine-scale mapping of QTL.

Meuwissen T, van den Berg I, Goddard M Genet Sel Evol. 2021; 53(1):19.

PMID: 33637049 PMC: 7908738. DOI: 10.1186/s12711-021-00607-4.


A multi-breed reference panel and additional rare variants maximize imputation accuracy in cattle.

Rowan T, Hoff J, Crum T, Taylor J, Schnabel R, Decker J Genet Sel Evol. 2019; 51(1):77.

PMID: 31878893 PMC: 6933688. DOI: 10.1186/s12711-019-0519-x.

References
1.
van den Berg I, Boichard D, Guldbrandtsen B, Lund M . Using Sequence Variants in Linkage Disequilibrium with Causative Mutations to Improve Across-Breed Prediction in Dairy Cattle: A Simulation Study. G3 (Bethesda). 2016; 6(8):2553-61. PMC: 4978908. DOI: 10.1534/g3.116.027730. View

2.
Hayes B, Visscher P, Goddard M . Increased accuracy of artificial selection by using the realized relationship matrix. Genet Res (Camb). 2009; 91(1):47-60. DOI: 10.1017/S0016672308009981. View

3.
Caballero A, Tenesa A, Keightley P . The Nature of Genetic Variation for Complex Traits Revealed by GWAS and Regional Heritability Mapping Analyses. Genetics. 2015; 201(4):1601-13. PMC: 4676519. DOI: 10.1534/genetics.115.177220. View

4.
VanRaden P . Efficient methods to compute genomic predictions. J Dairy Sci. 2008; 91(11):4414-23. DOI: 10.3168/jds.2007-0980. View

5.
Calus M, Bouwman A, Schrooten C, Veerkamp R . Efficient genomic prediction based on whole-genome sequence data using split-and-merge Bayesian variable selection. Genet Sel Evol. 2016; 48(1):49. PMC: 4926307. DOI: 10.1186/s12711-016-0225-x. View