» Articles » PMID: 14557111

Multivariate Exploratory Tools for Microarray Data Analysis

Overview
Journal Biostatistics
Specialty Public Health
Date 2003 Oct 15
PMID 14557111
Citations 16
Authors
Affiliations
Soon will be listed here.
Abstract

The ultimate success of microarray technology in basic and applied biological sciences depends critically on the development of statistical methods for gene expression data analysis. The most widely used tests for differential expression of genes are essentially univariate. Such tests disregard the multidimensional structure of microarray data. Multivariate methods are needed to utilize the information hidden in gene interactions and hence to provide more powerful and biologically meaningful methods for finding subsets of differentially expressed genes. The objective of this paper is to develop methods of multidimensional search for biologically significant genes, considering expression signals as mutually dependent random variables. To attain these ends, we consider the utility of a pertinent distance between random vectors and its empirical counterpart constructed from gene expression data. The distance furnishes exploratory procedures aimed at finding a target subset of differentially expressed genes. To determine the size of the target subset, we resort to successive elimination of smaller subsets resulting from each step of a random search algorithm based on maximization of the proposed distance. Different stopping rules associated with this procedure are evaluated. The usefulness of the proposed approach is illustrated with an application to the analysis of two sets of gene expression data.

Citing Articles

Gene Coexpression Analyses Differentiate Networks Associated with Diverse Cancers Harboring TP53 Missense or Null Mutations.

Klein K, Oualkacha K, Lafond M, Bhatnagar S, Tonin P, Greenwood C Front Genet. 2016; 7:137.

PMID: 27536319 PMC: 4971393. DOI: 10.3389/fgene.2016.00137.


Evaluation of bias-variance trade-off for commonly used post-summarizing normalization procedures in large-scale gene expression studies.

Qiu X, Hu R, Wu Z PLoS One. 2014; 9(6):e99380.

PMID: 24941114 PMC: 4062409. DOI: 10.1371/journal.pone.0099380.


The impact of quantile and rank normalization procedures on the testing power of gene differential expression analysis.

Qiu X, Wu H, Hu R BMC Bioinformatics. 2013; 14:124.

PMID: 23578321 PMC: 3660216. DOI: 10.1186/1471-2105-14-124.


Analysis of high dimensional data using pre-defined set and subset information, with applications to genomic data.

Guo W, Yang M, Xing C, Peddada S BMC Bioinformatics. 2012; 13:177.

PMID: 22827252 PMC: 3443674. DOI: 10.1186/1471-2105-13-177.


Hierarchical parallelization of gene differential association analysis.

Needham M, Hu R, Dwarkadas S, Qiu X BMC Bioinformatics. 2011; 12:374.

PMID: 21936916 PMC: 3248234. DOI: 10.1186/1471-2105-12-374.