A Novel Approach to Detecting Epistasis Using Random Sampling Regularisation
Overview
Affiliations
Epistasis is a progressive approach that complements the 'common disease, common variant' hypothesis that highlights the potential for connected networks of genetic variants collaborating to produce a phenotypic expression. Epistasis is commonly performed as a pairwise or limitless-arity capacity that considers variant networks as either variant vs variant or as high order interactions. This type of analysis extends the number of tests that were previously performed in a standard approach such as Genome-Wide Association Study (GWAS), in which False Discovery Rate (FDR) is already an issue, therefore by multiplying the number of tests up to a factorial rate also increases the issue of FDR. Further to this, epistasis introduces its own limitations of computational complexity and intensity that are generated based on the analysis performed; to consider the most intense approach, a multivariate analysis introduces a time complexity of O(n!). Proposed in this paper is a novel methodology for the detection of epistasis using interpretable methods and best practice to outline interactions through filtering processes. Using a process of Random Sampling Regularisation which randomly splits and produces sample sets to conduct a voting system to regularise the significance and reliability of biological markers, SNPs. Preliminary results are promising, outlining a concise detection of interactions. Results for the detection of epistasis, in the classification of breast cancer patients, indicated eight outlined risk candidate interactions from five variants and a singular candidate variant with high protective association.
Carre C, Carluer J, Chaux C, Estoup-Streiff C, Roche N, Hosy E Genome Biol. 2024; 25(1):76.
PMID: 38523316 PMC: 10962106. DOI: 10.1186/s13059-024-03202-0.
Ahuja S, Shrimankar D, Durge A Curr Genomics. 2024; 24(4):207-235.
PMID: 38169652 PMC: 10758128. DOI: 10.2174/0113892029269523231101051455.
Genetic interactions effects for cancer disease identification using computational models: a review.
Manavalan R, Priya S Med Biol Eng Comput. 2021; 59(4):733-758.
PMID: 33839998 DOI: 10.1007/s11517-021-02343-9.
Walsh R, Voorhies K, McDonald M, McGeachie M, Sordillo J, Lange C J Pers Med. 2021; 11(2).
PMID: 33669563 PMC: 7922125. DOI: 10.3390/jpm11020145.