Optimal Selection of SNP Markers for Disease Association Studies
Affiliations
Genetic association studies with population samples hold the promise of uncovering the susceptibility genes underlying the heritability of complex or common disease. Most association studies rely on the use of surrogate markers, single-nucleotide polymorphism (SNP) being the most suitable due to their abundance and ease of scoring. SNP marker selection is aimed to increase the chances that at least one typed SNP would be in linkage disequilibrium (LD) with the disease causative variant, while at the same time controlling the cost of the study in terms of the number of markers genotyped and samples. Empirical studies reporting block-like segments in the genome with high LD and low haplotype diversity have motivated a marker selection strategy whereby subsets of SNPs that 'tag' the common haplotypes of a region are picked for genotyping, avoiding typing redundant SNPs. Based on these initial observations, a plethora of 'tagging' algorithms for selecting minimum informative subsets of SNPs has recently appeared in the literature. These differ mostly in two major aspects: the quality or correlation measure used to define tagging and the algorithm used for the minimization of the final number of tagging SNPs. In this review we describe the available tagging algorithms utilizing a 3-step unifying framework, point out their methodological and conceptual differences, and make an assessment of their assumptions, performance, and scalability.
Cull A, Joly D BMC Genomics. 2025; 26(1):83.
PMID: 39875833 PMC: 11773717. DOI: 10.1186/s12864-025-11263-z.
A SINE-VNTR- in the LRIG2 Promoter Is Associated with Gene Expression at the Locus.
Hall A, Moore A, Hernandez D, Billingsley K, Bubb V, Quinn J Int J Mol Sci. 2020; 21(22).
PMID: 33187279 PMC: 7697779. DOI: 10.3390/ijms21228486.
Xia Q, Wang M, Yang X, Li X, Zhang X, Xu S Genes Immun. 2016; 18(1):42-47.
PMID: 28031552 DOI: 10.1038/gene.2016.48.
Henning J, Coggins J, Peterson M BMC Res Notes. 2015; 8:542.
PMID: 26438052 PMC: 4595125. DOI: 10.1186/s13104-015-1492-2.
The utility of low-density genotyping for imputation in the Thoroughbred horse.
Corbin L, Kranis A, Blott S, Swinburne J, Vaudin M, Bishop S Genet Sel Evol. 2014; 46:9.
PMID: 24495673 PMC: 3930001. DOI: 10.1186/1297-9686-46-9.