» Articles » PMID: 28108552

Predicted Residual Error Sum of Squares of Mixed Models: An Application for Genomic Prediction

Overview
Journal G3 (Bethesda)
Date 2017 Jan 22
PMID 28108552
Citations 14
Authors
Affiliations
Soon will be listed here.
Abstract

Genomic prediction is a statistical method to predict phenotypes of polygenic traits using high-throughput genomic data. Most diseases and behaviors in humans and animals are polygenic traits. The majority of agronomic traits in crops are also polygenic. Accurate prediction of these traits can help medical professionals diagnose acute diseases and breeders to increase food products, and therefore significantly contribute to human health and global food security. The best linear unbiased prediction (BLUP) is an important tool to analyze high-throughput genomic data for prediction. However, to judge the efficacy of the BLUP model with a particular set of predictors for a given trait, one has to provide an unbiased mechanism to evaluate the predictability. Cross-validation (CV) is an essential tool to achieve this goal, where a sample is partitioned into parts of roughly equal size, one part is predicted using parameters estimated from the remaining - 1 parts, and eventually every part is predicted using a sample excluding that part. Such a CV is called the K-fold CV. Unfortunately, CV presents a substantial increase in computational burden. We developed an alternative method, the HAT method, to replace CV. The new method corrects the estimated residual errors from the whole sample analysis using the leverage values of a hat matrix of the random effects to achieve the predicted residual errors. Properties of the HAT method were investigated using seven agronomic and 1000 metabolomic traits of an inbred rice population. Results showed that the HAT method is a very good approximation of the CV method. The method was also applied to 10 traits in 1495 hybrid rice with 1.6 million SNPs, and to human height of 6161 subjects with roughly 0.5 million SNPs of the Framingham heart study data. Predictabilities of the HAT and CV methods were all similar. The HAT method allows us to easily evaluate the predictabilities of genomic prediction for large numbers of traits in very large populations.

Citing Articles

GA-GBLUP: leveraging the genetic algorithm to improve the predictability of genomic selection.

Xu Y, Zhang Y, Cui Y, Zhou K, Yu G, Yang W Brief Bioinform. 2024; 25(5).

PMID: 39101500 PMC: 11299030. DOI: 10.1093/bib/bbae385.


An expression-directed linear mixed model discovering low-effect genetic variants.

Li Q, Bian J, Qian Y, Kossinna P, Gau C, Gordon P Genetics. 2024; 226(4).

PMID: 38314848 PMC: 11630775. DOI: 10.1093/genetics/iyae018.


Total Reflection X-ray Fluorescence Spectrometric Analysis of Ten Lanthanides at the Ultratrace Level Having a High Degree of Overlap in the Emission Lines.

Sanyal K, Saha A, Sarkar A, Deb S, Pai R, Saxena M ACS Omega. 2023; 8(44):41402-41410.

PMID: 37970058 PMC: 10633917. DOI: 10.1021/acsomega.3c05139.


Integrating genome-wide association study into genomic selection for the prediction of agronomic traits in rice ( L.).

Zhang Y, Zhang M, Ye J, Xu Q, Feng Y, Xu S Mol Breed. 2023; 43(11):81.

PMID: 37965378 PMC: 10641074. DOI: 10.1007/s11032-023-01423-y.


Genome-wide association study and genomic prediction for yield and grain quality traits of hybrid rice.

Yu P, Ye C, Li L, Yin H, Zhao J, Wang Y Mol Breed. 2023; 42(4):16.

PMID: 37309463 PMC: 10248665. DOI: 10.1007/s11032-022-01289-6.


References
1.
de Los Campos G, Naya H, Gianola D, Crossa J, Legarra A, Manfredi E . Predicting quantitative traits with regression models for dense molecular markers and pedigree. Genetics. 2009; 182(1):375-85. PMC: 2674834. DOI: 10.1534/genetics.109.101501. View

2.
VanRaden P . Efficient methods to compute genomic predictions. J Dairy Sci. 2008; 91(11):4414-23. DOI: 10.3168/jds.2007-0980. View

3.
Huang X, Yang S, Gong J, Zhao Y, Feng Q, Gong H . Genomic analysis of hybrid rice varieties reveals numerous superior alleles that contribute to heterosis. Nat Commun. 2015; 6:6258. PMC: 4327311. DOI: 10.1038/ncomms7258. View

4.
Henderson C . Best linear unbiased estimation and prediction under a selection model. Biometrics. 1975; 31(2):423-47. View

5.
Yu H, Xie W, Wang J, Xing Y, Xu C, Li X . Gains in QTL detection using an ultra-high density SNP map based on population sequencing relative to traditional RFLP/SSR markers. PLoS One. 2011; 6(3):e17595. PMC: 3048400. DOI: 10.1371/journal.pone.0017595. View