» Articles » PMID: 21423405

Conceptual Aspects of Large Meta-analyses with Publicly Available Microarray Data: a Case Study in Oncology

Overview
Publisher Sage Publications
Specialty Biology
Date 2011 Mar 23
PMID 21423405
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Large public repositories of microarray experiments offer an abundance of biological data. It is of interest to use and to combine the available material to create new biological information and to develop a broader view on biological phenomena.Meta-analyses recombine similar information over a series of experiments to sketch scientific aspects which were not accessible by each of the single experiments. Meta-analysis of high-throughput experiments has to handle methodological as well as technical challenges. Methodological aspects concern the identification of homogeneous material which can be combined by appropriate statistical procedures. Technical challenges come from the data management of large amounts of high-dimensional data, long computation time, as well as the handling of the stored phenotype data.This paper compares in a meta-analysis of a large series of microarray experiments the interaction structure within selected pathways between different tumour entities. The feasibility of such a study is explored and a technical as well as a statistical framework for its completion is presented. Multiple obstacles were met during completion of this project. They are mainly related to the quality of the available data and influence the biological interpretation of the results derived.The sobering experience of our study asks for combined efforts to improve the data quality in public repositories of high-throughput data. The exploration of the available data in large meta-analyses is limited by incomplete documentation of essential aspects of experiments and studies, by technical deficiencies in the data stored, and by careless duplications of data.

Citing Articles

Statistical Evidence Suggests that Inattention Drives Hyperactivity/Impulsivity in Attention Deficit-Hyperactivity Disorder.

Sokolova E, Groot P, Claassen T, van Hulzen K, Glennon J, Franke B PLoS One. 2016; 11(10):e0165120.

PMID: 27768717 PMC: 5074570. DOI: 10.1371/journal.pone.0165120.


Rethinking Meta-Analysis: Applications for Air Pollution Data and Beyond.

Goodman J, Petito Boyce C, Sax S, Beyer L, Prueitt R Risk Anal. 2015; 35(6):1017-39.

PMID: 25969128 PMC: 4690509. DOI: 10.1111/risa.12405.

References
1.
Kauffmann A, Rayner T, Parkinson H, Kapushesky M, Lukk M, Brazma A . Importing ArrayExpress datasets into R/Bioconductor. Bioinformatics. 2009; 25(16):2092-4. PMC: 2723004. DOI: 10.1093/bioinformatics/btp354. View

2.
Johnson W, Li C, Rabinovic A . Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2006; 8(1):118-27. DOI: 10.1093/biostatistics/kxj037. View

3.
Gardiner-Garden M, Littlejohn T . A comparison of microarray databases. Brief Bioinform. 2001; 2(2):143-58. DOI: 10.1093/bib/2.2.143. View

4.
Friedman J, Hastie T, Tibshirani R . Sparse inverse covariance estimation with the graphical lasso. Biostatistics. 2007; 9(3):432-41. PMC: 3019769. DOI: 10.1093/biostatistics/kxm045. View

5.
Ball C, Brazma A, Causton H, Chervitz S, Edgar R, Hingamp P . Submission of microarray data to public repositories. PLoS Biol. 2004; 2(9):E317. PMC: 514887. DOI: 10.1371/journal.pbio.0020317. View