A Copula Based Approach for Design of Multivariate Random Forests for Drug Sensitivity Prediction
Overview
Affiliations
Modeling sensitivity to drugs based on genetic characterizations is a significant challenge in the area of systems medicine. Ensemble based approaches such as Random Forests have been shown to perform well in both individual sensitivity prediction studies and team science based prediction challenges. However, Random Forests generate a deterministic predictive model for each drug based on the genetic characterization of the cell lines and ignores the relationship between different drug sensitivities during model generation. This application motivates the need for generation of multivariate ensemble learning techniques that can increase prediction accuracy and improve variable importance ranking by incorporating the relationships between different output responses. In this article, we propose a novel cost criterion that captures the dissimilarity in the output response structure between the training data and node samples as the difference in the two empirical copulas. We illustrate that copulas are suitable for capturing the multivariate structure of output responses independent of the marginal distributions and the copula based multivariate random forest framework can provide higher accuracy prediction and improved variable selection. The proposed framework has been validated on genomics of drug sensitivity for cancer and cancer cell line encyclopedia database.
ITNR: Inversion Transformer-based Neural Ranking for cancer drug recommendations.
Sotudian S, Paschalidis I Comput Biol Med. 2024; 172:108312.
PMID: 38503090 PMC: 10990436. DOI: 10.1016/j.compbiomed.2024.108312.
Multivariate random forest prediction of poverty and malnutrition prevalence.
Browne C, Matteson D, McBride L, Hu L, Liu Y, Sun Y PLoS One. 2021; 16(9):e0255519.
PMID: 34495951 PMC: 8425567. DOI: 10.1371/journal.pone.0255519.
A Review of Current Methods for Repositioning Drugs and Chemical Compounds.
He B, Hou F, Ren C, Bing P, Xiao X Front Oncol. 2021; 11:711225.
PMID: 34367996 PMC: 8340770. DOI: 10.3389/fonc.2021.711225.
Sotudian S, Paschalidis I IEEE/ACM Trans Comput Biol Bioinform. 2021; 19(4):2324-2333.
PMID: 34043512 PMC: 9642333. DOI: 10.1109/TCBB.2021.3084562.
Evaluating the consistency of large-scale pharmacogenomic studies.
Rahman R, Dhruba S, Matlock K, De-Niz C, Ghosh S, Pal R Brief Bioinform. 2019; 20(5):1734-1753.
PMID: 31846027 PMC: 6917220. DOI: 10.1093/bib/bby046.