» Articles » PMID: 15944369

Adding Confidence to Gene Expression Clustering

Overview
Journal Genetics
Specialty Genetics
Date 2005 Jun 10
PMID 15944369
Citations 6
Authors
Affiliations
Soon will be listed here.
Abstract

It has been well established that gene expression data contain large amounts of random variation that affects both the analysis and the results of microarray experiments. Typically, microarray data are either tested for differential expression between conditions or grouped on the basis of profiles that are assessed temporally or across genetic or environmental conditions. While testing differential expression relies on levels of certainty to evaluate the relative worth of various analyses, cluster analysis is exploratory in nature and has not had the benefit of any judgment of statistical inference. By using a novel dissimilarity function to ascertain gene expression clusters and conditional randomization of the data space to illuminate distinctions between statistically significant clusters of gene expression patterns, we aim to provide a level of confidence to inferred clusters of gene expression data. We apply both permutation and convex hull approaches for randomization of the data space and show that both methods can provide an effective assessment of gene expression profiles whose coregulation is statistically different from that expected by random chance alone.

Citing Articles

petal: Co-expression network modelling in R.

Petereit J, Smith S, Harris Jr F, Schlauch K BMC Syst Biol. 2016; 10 Suppl 2:51.

PMID: 27490697 PMC: 4977474. DOI: 10.1186/s12918-016-0298-8.


Dynamic clustering of gene expression.

An L, Doerge R ISRN Bioinform. 2015; 2012:537217.

PMID: 25969750 PMC: 4393063. DOI: 10.5402/2012/537217.


Significant distinct branches of hierarchical trees: a framework for statistical analysis and applications to biological data.

Sun G, Krasnitz A BMC Genomics. 2014; 15:1000.

PMID: 25409689 PMC: 4253613. DOI: 10.1186/1471-2164-15-1000.


Comparative analysis of acute and chronic corticosteroid pharmacogenomic effects in rat liver: transcriptional dynamics and regulatory structures.

Nguyen T, Almon R, DuBois D, Jusko W, Androulakis I BMC Bioinformatics. 2010; 11:515.

PMID: 20946642 PMC: 2973961. DOI: 10.1186/1471-2105-11-515.


Importance of replication in analyzing time-series gene expression data: corticosteroid dynamics and circadian patterns in rat liver.

Nguyen T, Almon R, DuBois D, Jusko W, Androulakis I BMC Bioinformatics. 2010; 11:279.

PMID: 20500897 PMC: 2889936. DOI: 10.1186/1471-2105-11-279.


References
1.
Nettleton D, Doerge R . Accounting for variability in the use of permutation testing to detect quantitative trait loci. Biometrics. 2000; 56(1):52-8. DOI: 10.1111/j.0006-341x.2000.00052.x. View

2.
Brem R, Kruglyak L . The landscape of genetic complexity across 5,700 gene expression traits in yeast. Proc Natl Acad Sci U S A. 2005; 102(5):1572-7. PMC: 547855. DOI: 10.1073/pnas.0408709102. View

3.
Zhang K, Zhao H . Assessing reliability of gene clusters from gene expression data. Funct Integr Genomics. 2002; 1(3):156-73. DOI: 10.1007/s101420000019. View

4.
Doerge R . Mapping and analysis of quantitative trait loci in experimental populations. Nat Rev Genet. 2002; 3(1):43-52. DOI: 10.1038/nrg703. View

5.
McShane L, Radmacher M, Freidlin B, Yu R, Li M, Simon R . Methods for assessing reproducibility of clustering patterns observed in analyses of microarray data. Bioinformatics. 2002; 18(11):1462-9. DOI: 10.1093/bioinformatics/18.11.1462. View