Prediction of Functional Regulatory SNPs in Monogenic and Complex Disease
Affiliations
Next-generation sequencing (NGS) technologies are yielding ever higher volumes of human genome sequence data. Given this large amount of data, it has become both a possibility and a priority to determine how disease-causing single nucleotide polymorphisms (SNPs) detected within gene regulatory regions (rSNPs) exert their effects on gene expression. Recently, several studies have explored whether disease-causing polymorphisms have attributes that can distinguish them from those that are neutral, attaining moderate success at discriminating between functional and putatively neutral regulatory SNPs. Here, we have extended this work by assessing the utility of both SNP-based features (those associated only with the polymorphism site and the surrounding DNA) and gene-based features (those derived from the associated gene in whose regulatory region the SNP lies) in the identification of functional regulatory polymorphisms involved in either monogenic or complex disease. Gene-based features were found to be capable of both augmenting and enhancing the utility of SNP-based features in the prediction of known regulatory mutations. Adopting this approach, we achieved an AUC of 0.903 for predicting regulatory SNPs. Finally, our tool predicted 225 new regulatory SNPs with a high degree of confidence, with 105 of the 225 falling into linkage disequilibrium blocks of reported disease-associated genome-wide association studies SNPs.
Variation benchmark datasets: update, criteria, quality and applications.
Sarkar A, Yang Y, Vihinen M Database (Oxford). 2020; 2020.
PMID: 32016318 PMC: 6997940. DOI: 10.1093/database/baz117.
Yao Y, Ramsey S Pac Symp Biocomput. 2019; 25:535-546.
PMID: 31797625 PMC: 6897322.
CERENKOV2: improved detection of functional noncoding SNPs using data-space geometric features.
Yao Y, Liu Z, Wei Q, Ramsey S BMC Bioinformatics. 2019; 20(1):63.
PMID: 30727967 PMC: 6364436. DOI: 10.1186/s12859-019-2637-4.
Bryzgalov L, Korbolina E, Brusentsov I, Leberfarb E, Bondar N, Merkulova T BMC Neurosci. 2018; 19(Suppl 1):22.
PMID: 29745862 PMC: 5998904. DOI: 10.1186/s12868-018-0414-3.
Regulatory single nucleotide polymorphisms (rSNPs) at the promoters 1A and 1B of the human APC gene.
Matveeva M, Kashina E, Reshetnikov V, Bryzgalov L, Antontseva E, Bondar N BMC Genet. 2017; 17(Suppl 3):154.
PMID: 28105931 PMC: 5249005. DOI: 10.1186/s12863-016-0460-8.