» Articles » PMID: 17217505

Determination of the Minimum Number of Microarray Experiments for Discovery of Gene Expression Patterns

Overview
Publisher Biomed Central
Specialty Biology
Date 2007 Jan 16
PMID 17217505
Citations 1
Authors
Affiliations
Soon will be listed here.
Abstract

Background: One type of DNA microarray experiment is discovery of gene expression patterns for a cell line undergoing a biological process over a series of time points. Two important issues with such an experiment are the number of time points, and the interval between them. In the absence of biological knowledge regarding appropriate values, it is natural to question whether the behaviour of progressively generated data may by itself determine a threshold beyond which further microarray experiments do not contribute to pattern discovery. Additionally, such a threshold implies a minimum number of microarray experiments, which is important given the cost of these experiments.

Results: We have developed a method for determining the minimum number of microarray experiments (i.e. time points) for temporal gene expression, assuming that the span between time points is given and the hierarchical clustering technique is used for gene expression pattern discovery. The key idea is a similarity measure for two clusterings which is expressed as a function of the data for progressive time points. While the experiments are underway, this function is evaluated. When the function reaches its maximum, it indicates the set of experiments reach a saturated state. Therefore, further experiments do not contribute to the discrimination of patterns.

Conclusion: The method has been verified with two previously published gene expression datasets. For both experiments, the number of time points determined with our method is less than in the published experiments. It is noted that the overall approach is applicable to other clustering techniques.

Citing Articles

Development of computations in bioscience and bioinformatics and its application: review of the Symposium of Computations in Bioinformatics and Bioscience (SCBB06).

Deng Y, Ni J, Zhang C BMC Bioinformatics. 2007; 7 Suppl 4:S1.

PMID: 17217501 PMC: 1780134. DOI: 10.1186/1471-2105-7-S4-S1.

References
1.
Filkov V, Skiena S, Zhi J . Analysis techniques for microarray time-series data. J Comput Biol. 2002; 9(2):317-30. DOI: 10.1089/10665270252935485. View

2.
Hwang D, Schmitt W, Stephanopoulos G, Stephanopoulos G . Determination of minimum sample size and discriminatory expression patterns in microarray data. Bioinformatics. 2002; 18(9):1184-93. DOI: 10.1093/bioinformatics/18.9.1184. View

3.
Schena M, Shalon D, Davis R, Brown P . Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science. 1995; 270(5235):467-70. DOI: 10.1126/science.270.5235.467. View

4.
Langmead C, Yan A, McClung C, Donald B . Phase-independent rhythmic analysis of genome-wide expression patterns. J Comput Biol. 2003; 10(3-4):521-36. DOI: 10.1089/10665270360688165. View

5.
Iyer V, Eisen M, Ross D, Schuler G, Moore T, Lee J . The transcriptional program in the response of human fibroblasts to serum. Science. 1999; 283(5398):83-7. DOI: 10.1126/science.283.5398.83. View