» Articles » PMID: 21796725

Prediction of Functional Regulatory SNPs in Monogenic and Complex Disease

Overview
Journal Hum Mutat
Specialty Genetics
Date 2011 Jul 29
PMID 21796725
Citations 12
Authors
Affiliations
Soon will be listed here.
Abstract

Next-generation sequencing (NGS) technologies are yielding ever higher volumes of human genome sequence data. Given this large amount of data, it has become both a possibility and a priority to determine how disease-causing single nucleotide polymorphisms (SNPs) detected within gene regulatory regions (rSNPs) exert their effects on gene expression. Recently, several studies have explored whether disease-causing polymorphisms have attributes that can distinguish them from those that are neutral, attaining moderate success at discriminating between functional and putatively neutral regulatory SNPs. Here, we have extended this work by assessing the utility of both SNP-based features (those associated only with the polymorphism site and the surrounding DNA) and gene-based features (those derived from the associated gene in whose regulatory region the SNP lies) in the identification of functional regulatory polymorphisms involved in either monogenic or complex disease. Gene-based features were found to be capable of both augmenting and enhancing the utility of SNP-based features in the prediction of known regulatory mutations. Adopting this approach, we achieved an AUC of 0.903 for predicting regulatory SNPs. Finally, our tool predicted 225 new regulatory SNPs with a high degree of confidence, with 105 of the 225 falling into linkage disequilibrium blocks of reported disease-associated genome-wide association studies SNPs.

Citing Articles

Variation benchmark datasets: update, criteria, quality and applications.

Sarkar A, Yang Y, Vihinen M Database (Oxford). 2020; 2020.

PMID: 32016318 PMC: 6997940. DOI: 10.1093/database/baz117.


CERENKOV3: Clustering and molecular network-derived features improve computational prediction of functional noncoding SNPs.

Yao Y, Ramsey S Pac Symp Biocomput. 2019; 25:535-546.

PMID: 31797625 PMC: 6897322.


CERENKOV2: improved detection of functional noncoding SNPs using data-space geometric features.

Yao Y, Liu Z, Wei Q, Ramsey S BMC Bioinformatics. 2019; 20(1):63.

PMID: 30727967 PMC: 6364436. DOI: 10.1186/s12859-019-2637-4.


Novel functional variants at the GWAS-implicated loci might confer risk to major depressive disorder, bipolar affective disorder and schizophrenia.

Bryzgalov L, Korbolina E, Brusentsov I, Leberfarb E, Bondar N, Merkulova T BMC Neurosci. 2018; 19(Suppl 1):22.

PMID: 29745862 PMC: 5998904. DOI: 10.1186/s12868-018-0414-3.


Regulatory single nucleotide polymorphisms (rSNPs) at the promoters 1A and 1B of the human APC gene.

Matveeva M, Kashina E, Reshetnikov V, Bryzgalov L, Antontseva E, Bondar N BMC Genet. 2017; 17(Suppl 3):154.

PMID: 28105931 PMC: 5249005. DOI: 10.1186/s12863-016-0460-8.


References
1.
Guo Y, Jamison D . The distribution of SNPs in human gene regulatory regions. BMC Genomics. 2005; 6:140. PMC: 1260019. DOI: 10.1186/1471-2164-6-140. View

2.
Pruitt K, Tatusova T, Maglott D . NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2006; 35(Database issue):D61-5. PMC: 1716718. DOI: 10.1093/nar/gkl842. View

3.
Sherry S, Ward M, Kholodov M, Baker J, Phan L, Smigielski E . dbSNP: the NCBI database of genetic variation. Nucleic Acids Res. 2000; 29(1):308-11. PMC: 29783. DOI: 10.1093/nar/29.1.308. View

4.
Lopez-Bigas N, Ouzounis C . Genome-wide identification of genes likely to be involved in human genetic disease. Nucleic Acids Res. 2004; 32(10):3108-14. PMC: 434425. DOI: 10.1093/nar/gkh605. View

5.
Karolchik D, Kuhn R, Baertsch R, Barber G, Clawson H, Diekhans M . The UCSC Genome Browser Database: 2008 update. Nucleic Acids Res. 2007; 36(Database issue):D773-9. PMC: 2238835. DOI: 10.1093/nar/gkm966. View