» Articles » PMID: 16342179

Data Mining

Overview
Journal Genet Epidemiol
Specialties Genetics
Public Health
Date 2005 Dec 13
PMID 16342179
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

Group 14 used data-mining strategies to evaluate a number of issues, including appropriate diagnosis, haplotype estimation, genetic linkage and association studies, and type I error. Methods ranged from exploratory analyses, to machine learning strategies (neural networks, supervised learning, and tree-based methods), to false discovery rate control of type I errors. The general motivations were to find the "story" in the data and to summarize information from a multitude of measures. Several methods illustrated strategies for better trait definition, using summarization of related traits. In the few studies that sought to identify genes for alcoholism, there was little agreement among the different strategies, likely reflecting the complexities of the disease. Nevertheless, Group 14 found that these methods offered strategies to gain a better understanding of the complex pathways by which disease develops.

Citing Articles

Gene-environment interactions in genome-wide association studies: a comparative study of tests applied to empirical studies of type 2 diabetes.

Cornelis M, Tchetgen Tchetgen E, Liang L, Qi L, Chatterjee N, Hu F Am J Epidemiol. 2011; 175(3):191-202.

PMID: 22199026 PMC: 3261439. DOI: 10.1093/aje/kwr368.


A conditional synergy index to assess biological interaction.

Foraita R Eur J Epidemiol. 2009; 24(9):485-94.

PMID: 19669411 DOI: 10.1007/s10654-009-9378-z.


Application of two machine learning algorithms to genetic association studies in the presence of covariates.

Nonyane B, Foulkes A BMC Genet. 2008; 9:71.

PMID: 19014573 PMC: 2620353. DOI: 10.1186/1471-2156-9-71.


A parallel genetic algorithm to discover patterns in genetic markers that indicate predisposition to multifactorial disease.

Rausch T, Thomas A, Camp N, Cannon-Albright L, Facelli J Comput Biol Med. 2008; 38(7):826-36.

PMID: 18547558 PMC: 2532987. DOI: 10.1016/j.compbiomed.2008.04.011.