PROMISE: a Tool to Identify Genomic Features with a Specific Biologically Interesting Pattern of Associations with Multiple Endpoint Variables
Overview
Authors
Affiliations
Motivation: In some applications, prior biological knowledge can be used to define a specific pattern of association of multiple endpoint variables with a genomic variable that is biologically most interesting. However, to our knowledge, there is no statistical procedure designed to detect specific patterns of association with multiple endpoint variables.
Results: Projection onto the most interesting statistical evidence (PROMISE) is proposed as a general procedure to identify genomic variables that exhibit a specific biologically interesting pattern of association with multiple endpoint variables. Biological knowledge of the endpoint variables is used to define a vector that represents the biologically most interesting values for statistics that characterize the associations of the endpoint variables with a genomic variable. A test statistic is defined as the dot-product of the vector of the observed association statistics and the vector of the most interesting values of the association statistics. By definition, this test statistic is proportional to the length of the projection of the observed vector of correlations onto the vector of most interesting associations. Statistical significance is determined via permutation. In simulation studies and an example application, PROMISE shows greater statistical power to identify genes with the interesting pattern of associations than classical multivariate procedures, individual endpoint analyses or listing genes that have the pattern of interest and are significant in more than one individual endpoint analysis.
Availability: Documented R routines are freely available from www.stjuderesearch.org/depts/biostats and will soon be available as a Bioconductor package from www.bioconductor.org.
Seffernick A, Cao X, Cheng C, Yang W, Autry R, Yang J bioRxiv. 2024; .
PMID: 39131398 PMC: 11312528. DOI: 10.1101/2024.07.31.605805.
Statistical Methods Inspired by Challenges in Pediatric Cancer Multi-omics.
Cao X, H Elsayed A, Pounds S Methods Mol Biol. 2023; 2629:349-373.
PMID: 36929085 DOI: 10.1007/978-1-0716-2986-4_16.
Genetics of pleiotropic effects of dexamethasone.
Ramsey L, Pounds S, Cheng C, Cao X, Yang W, Smith C Pharmacogenet Genomics. 2017; 27(8):294-302.
PMID: 28628558 PMC: 5523978. DOI: 10.1097/FPC.0000000000000293.
Cao X, Crews K, Downing J, Lamba J, Pounds S BMC Bioinformatics. 2016; 17(Suppl 13):382.
PMID: 27766934 PMC: 5073973. DOI: 10.1186/s12859-016-1217-0.
Clinical significance of in vivo cytarabine-induced gene expression signature in AML.
Lamba J, Pounds S, Cao X, Crews K, Cogle C, Bhise N Leuk Lymphoma. 2015; 57(4):909-20.
PMID: 26366682 PMC: 4794368. DOI: 10.3109/10428194.2015.1086918.