» Articles » PMID: 12217910

Determination of Minimum Sample Size and Discriminatory Expression Patterns in Microarray Data

Overview
Journal Bioinformatics
Specialty Biology
Date 2002 Sep 10
PMID 12217910
Citations 29
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Transcriptional profiling using microarrays can reveal important information about cellular and tissue expression phenotypes, but these measurements are costly and time consuming. Additionally, tissue sample availability poses further constraints on the number of arrays that can be analyzed in connection with a particular disease or state of interest. It is therefore important to provide a method for the determination of the minimum number of microarrays required to separate, with statistical reliability, distinct disease states or other physiological differences.

Results: Power analysis was applied to estimate the minimum sample size required for two-class and multi-class discrimination. The power analysis algorithm calculates the appropriate sample size for discrimination of phenotypic subtypes in a reduced dimensional space obtained by Fisher discriminant analysis (FDA). This approach was tested by applying the algorithm to existing data sets for estimation of the minimum sample size required for drawing certain conclusions on multi-class distinction with statistical reliability. It was confirmed that when the minimum number of samples estimated from power analysis is used, group means in the FDA discrimination space are statistically different.

Contact: gregstep@mit.edu

Citing Articles

Mental health literacy in a diverse sample of undergraduate students: demographic, psychological, and academic correlates.

Miles R, Rabin L, Krishnan A, Grandoit E, Kloskowski K BMC Public Health. 2020; 20(1):1699.

PMID: 33187487 PMC: 7663887. DOI: 10.1186/s12889-020-09696-0.


Unique molecular signatures of microRNAs in ocular fluids and plasma in diabetic retinopathy.

Smit-McBride Z, Nguyen A, Yu A, Modjtahedi S, Hunter A, Rashid S PLoS One. 2020; 15(7):e0235541.

PMID: 32692745 PMC: 7373301. DOI: 10.1371/journal.pone.0235541.


MinePath: Mining for Phenotype Differential Sub-paths in Molecular Pathways.

Koumakis L, Kanterakis A, Kartsaki E, Chatzimina M, Zervakis M, Tsiknakis M PLoS Comput Biol. 2016; 12(11):e1005187.

PMID: 27832067 PMC: 5104320. DOI: 10.1371/journal.pcbi.1005187.


Study design in high-dimensional classification analysis.

Sanchez B, Wu M, Song P, Wang W Biostatistics. 2016; 17(4):722-36.

PMID: 27154835 PMC: 5031947. DOI: 10.1093/biostatistics/kxw018.


MetSizeR: selecting the optimal sample size for metabolomic studies using an analysis based approach.

Nyamundanda G, Gormley I, Fan Y, Gallagher W, Brennan L BMC Bioinformatics. 2013; 14:338.

PMID: 24261687 PMC: 4222287. DOI: 10.1186/1471-2105-14-338.