» Articles » PMID: 15601538

Development of an Integrated Genome Informatics, Data Management and Workflow Infrastructure: a Toolbox for the Study of Complex Disease Genetics

Abstract

The genetic dissection of complex disease remains a significant challenge. Sample-tracking and the recording, processing and storage of high-throughput laboratory data with public domain data, require integration of databases, genome informatics and genetic analyses in an easily updated and scaleable format. To find genes involved in multifactorial diseases such as type 1 diabetes (T1D), chromosome regions are defined based on functional candidate gene content, linkage information from humans and animal model mapping information. For each region, genomic information is extracted from Ensembl, converted and loaded into ACeDB for manual gene annotation. Homology information is examined using ACeDB tools and the gene structure verified. Manually curated genes are extracted from ACeDB and read into the feature database, which holds relevant local genomic feature data and an audit trail of laboratory investigations. Public domain information, manually curated genes, polymorphisms, primers, linkage and association analyses, with links to our genotyping database, are shown in Gbrowse. This system scales to include genetic, statistical, quality control (QC) and biological data such as expression analyses of RNA or protein, all linked from a genomics integrative display. Our system is applicable to any genetic study of complex disease, of either large or small scale.

Citing Articles

Modeling complex workflow in molecular diagnostics: design specifications of laboratory software for support of personalized medicine.

Gomah M, Turley J, Lu H, Jones D J Mol Diagn. 2009; 12(1):51-7.

PMID: 20007844 PMC: 2797718. DOI: 10.2353/jmoldx.2010.090082.


T1DBase: integration and presentation of complex data for type 1 diabetes research.

Hulbert E, Smink L, Adlem E, Allen J, Burdick D, Burren O Nucleic Acids Res. 2006; 35(Database issue):D742-6.

PMID: 17169983 PMC: 1781218. DOI: 10.1093/nar/gkl933.


Discovery, linkage disequilibrium and association analyses of polymorphisms of the immune complement inhibitor, decay-accelerating factor gene (DAF/CD55) in type 1 diabetes.

Taniguchi H, Lowe C, Cooper J, Smyth D, Bailey R, Nutland S BMC Genet. 2006; 7:22.

PMID: 16626483 PMC: 1479364. DOI: 10.1186/1471-2156-7-22.


Analysis of polymorphisms in 16 genes in type 1 diabetes that have been associated with other immune-mediated diseases.

Smyth D, Howson J, Payne F, Maier L, Bailey R, Holland K BMC Med Genet. 2006; 7:20.

PMID: 16519819 PMC: 1420277. DOI: 10.1186/1471-2350-7-20.


Polymorphism discovery and association analyses of the interferon genes in type 1 diabetes.

Morris G, Lowe C, Cooper J, Payne F, Vella A, Godfrey L BMC Genet. 2006; 7:12.

PMID: 16504056 PMC: 1402321. DOI: 10.1186/1471-2156-7-12.


References
1.
Chapman J, Cooper J, Todd J, Clayton D . Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power. Hum Hered. 2003; 56(1-3):18-31. DOI: 10.1159/000073729. View

2.
Oliphant A, Barker D, Stuelpnagel J, Chee M . BeadArray technology: enabling an accurate, cost-effective approach to high-throughput genotyping. Biotechniques. 2002; Suppl:56-8, 60-1. View

3.
Payne F, Smyth D, Pask R, Barratt B, Cooper J, Twells R . Haplotype tag single nucleotide polymorphism analysis of the human orthologues of the rat type 1 diabetes genes Ian4 (Lyp/Iddm1) and Cblb. Diabetes. 2004; 53(2):505-9. DOI: 10.2337/diabetes.53.2.505. View

4.
Sonnhammer E, Durbin R . A workbench for large-scale sequence homology analysis. Comput Appl Biosci. 1994; 10(3):301-7. DOI: 10.1093/bioinformatics/10.3.301. View

5.
Livak K, Marmaro J, Todd J . Towards fully automated genome-wide polymorphism screening. Nat Genet. 1995; 9(4):341-2. DOI: 10.1038/ng0495-341. View