A Multivariate Poisson Deep Learning Model for Genomic Prediction of Count Data
Overview
Molecular Biology
Authors
Affiliations
The paradigm called genomic selection (GS) is a revolutionary way of developing new plants and animals. This is a predictive methodology, since it uses learning methods to perform its task. Unfortunately, there is no universal model that can be used for all types of predictions; for this reason, specific methodologies are required for each type of output (response variables). Since there is a lack of efficient methodologies for multivariate count data outcomes, in this paper, a multivariate Poisson deep neural network (MPDN) model is proposed for the genomic prediction of various count outcomes simultaneously. The MPDN model uses the minus log-likelihood of a Poisson distribution as a loss function, in hidden layers for capturing nonlinear patterns using the rectified linear unit (RELU) activation function and, in the output layer, the exponential activation function was used for producing outputs on the same scale of counts. The proposed MPDN model was compared to conventional generalized Poisson regression models and univariate Poisson deep learning models in two experimental data sets of count data. We found that the proposed MPDL outperformed univariate Poisson deep neural network models, but did not outperform, in terms of prediction, the univariate generalized Poisson regression models. All deep learning models were implemented in Tensorflow as back-end and Keras as front-end, which allows implementing these models on moderate and large data sets, which is a significant advantage over previous GS models for multivariate count data.
Invited commentary: deep learning-methods to amplify epidemiologic data collection and analyses.
Quistberg D, Mooney S, Tasdizen T, Arbelaez P, Nguyen Q Am J Epidemiol. 2024; 194(2):322-326.
PMID: 39013794 PMC: 11815488. DOI: 10.1093/aje/kwae215.
Hong J, Kim Y, Cho E, Lee J, Kim Y, Park H Anim Biosci. 2024; 37(4):622-630.
PMID: 38228129 PMC: 10915216. DOI: 10.5713/ab.23.0264.
A novel method for genomic-enabled prediction of cultivars in new environments.
Montesinos-Lopez O, Ramos-Pulido S, Hernandez-Suarez C, Mosqueda Gonzalez B, Valladares-Anguiano F, Vitale P Front Plant Sci. 2023; 14:1218151.
PMID: 37564390 PMC: 10411573. DOI: 10.3389/fpls.2023.1218151.
Thanh Vu N, Phuc T, Nguyen N, Sang N Front Genet. 2023; 13:1081246.
PMID: 36685869 PMC: 9845282. DOI: 10.3389/fgene.2022.1081246.
A Review of Integrative Omic Approaches for Understanding Rice Salt Response Mechanisms.
Ullah M, Abdullah-Zawawi M, Zainal-Abidin R, Sukiran N, Uddin M, Zainal Z Plants (Basel). 2022; 11(11).
PMID: 35684203 PMC: 9182744. DOI: 10.3390/plants11111430.