» Articles » PMID: 11281279

Selecting SNPs in Two-stage Analysis of Disease Association Data: a Model-free Approach

Overview
Journal Ann Hum Genet
Date 2001 Apr 3
PMID 11281279
Citations 30
Authors
Affiliations
Soon will be listed here.
Abstract

For large numbers of marker loci in a genomic scan for disease loci, we propose a novel 2-stage approach for linkage or association analysis. The two stages are (1) selection of a subset of markers that are 'important' for the trait studied, and (2) modelling interactions among markers and between markers and trait. Here we focus on stage 1 and develop a selection method based on a 2-level nested bootstrap procedure. The method is applied to single nucleotide polymorphisms (SNPs) data in a cohort study of heart disease patients. Out of the 89 original SNPs the method selects 11 markers as being 'important'. Conventional backward stepwise logistic regression on the 89 SNPs selects 7 markers, which are a subset of the 11 markers chosen by our method.

Citing Articles

Machine learning on genome-wide association studies to predict the risk of radiation-associated contralateral breast cancer in the WECARE Study.

Lee S, Liang X, Woods M, Reiner A, Concannon P, Bernstein L PLoS One. 2020; 15(2):e0226157.

PMID: 32106268 PMC: 7046218. DOI: 10.1371/journal.pone.0226157.


An adaptive threshold determination method of feature screening for genomic selection.

Fu G, Wang G, Dai X BMC Bioinformatics. 2017; 18(1):212.

PMID: 28403836 PMC: 5389084. DOI: 10.1186/s12859-017-1617-9.


Genome-wide association data classification and SNPs selection using two-stage quality-based Random Forests.

Nguyen T, Huang J, Wu Q, Nguyen T, Li M BMC Genomics. 2015; 16 Suppl 2:S5.

PMID: 25708662 PMC: 4331719. DOI: 10.1186/1471-2164-16-S2-S5.


AprioriGWAS, a new pattern mining strategy for detecting genetic variants associated with disease through interaction effects.

Zhang Q, Long Q, Ott J PLoS Comput Biol. 2014; 10(6):e1003627.

PMID: 24901472 PMC: 4046917. DOI: 10.1371/journal.pcbi.1003627.


A nonparametric test to detect quantitative trait loci where the phenotypic distribution differs by genotypes.

Aschard H, Zaitlen N, Tamimi R, Lindstrom S, Kraft P Genet Epidemiol. 2013; 37(4):323-33.

PMID: 23512279 PMC: 4088942. DOI: 10.1002/gepi.21716.