» Articles » PMID: 31583263

FairSubset: A Tool to Choose Representative Subsets of Data for Use with Replicates or Groups of Different Sample Sizes

Overview
Journal J Biol Methods
Specialty Biology
Date 2019 Oct 5
PMID 31583263
Citations 4
Authors
Affiliations
Soon will be listed here.
Abstract

High-impact journals are promoting transparency of data. Modern scientific methods can be automated and produce disparate samples sizes. In many cases, it is desirable to retain identical or pre-defined sample sizes between replicates or groups. However, choosing which subset of originally acquired data that best matches the entirety of the data set without introducing bias is not trivial. Here, we released a free online tool, FairSubset, and its constituent Shiny App R code to subset data in an unbiased fashion. Subsets were set at the same N across samples and retained representative average and standard deviation information. The method can be used for quantitation of entire fields of view or other replicates without biasing the data pool toward large N samples. We showed examples of the tool's use with fluorescence data and DNA-damage related Comet tail quantitation. This FairSubset tool and the method to retain distribution information at the single-datum level may be considered for standardized use in fair publishing practices.

Citing Articles

Screening Methods to Discover the FDA-Approved Cancer Drug Encorafenib as Optimally Selective for Metallothionein Gene Loss Ovarian Cancer.

Rees A, Villamor E, Evans D, Gooz M, Fallon C, Mina-Abouda M Genes (Basel). 2025; 16(1.

PMID: 39858588 PMC: 11764637. DOI: 10.3390/genes16010042.


Automated hippocampal segmentation algorithms evaluated in stroke patients.

Schell M, Foltyn-Dumitru M, Bendszus M, Vollmuth P Sci Rep. 2023; 13(1):11712.

PMID: 37474622 PMC: 10359355. DOI: 10.1038/s41598-023-38833-z.


Combination of Autophagy Selective Therapeutics With Doxil: An Assessment of Pathological Toxicity.

Helke K, Gudi R, Vasu C, Delaney J Front Toxicol. 2022; 4:937150.

PMID: 35846434 PMC: 9276957. DOI: 10.3389/ftox.2022.937150.


An Exploratory Analysis of the Internal Structure of Test Through a Multimethods Exploratory Approach of the ASQ:SE in Brazil.

Anunciacao L, Squires J, Landeira-Fernandez J, Singh A J Neurosci Rural Pract. 2022; 13(2):186-195.

PMID: 35694052 PMC: 9187369. DOI: 10.1055/s-0041-1741503.


Single-cell analysis of copy-number alterations in serous ovarian cancer reveals substantial heterogeneity in both low- and high-grade tumors.

Kumar M, Bowers R, Delaney J Cell Cycle. 2020; 19(22):3154-3166.

PMID: 33121339 PMC: 7714496. DOI: 10.1080/15384101.2020.1836439.

References
1.
Weissgerber T, Milic N, Winham S, Garovic V . Beyond bar and line graphs: time for a new data presentation paradigm. PLoS Biol. 2015; 13(4):e1002128. PMC: 4406565. DOI: 10.1371/journal.pbio.1002128. View

2.
Ghasemi A, Zahediasl S . Normality tests for statistical analysis: a guide for non-statisticians. Int J Endocrinol Metab. 2013; 10(2):486-9. PMC: 3693611. DOI: 10.5812/ijem.3505. View

3.
Gyori B, Venkatachalam G, Thiagarajan P, Hsu D, Clement M . OpenComet: an automated tool for comet assay image analysis. Redox Biol. 2014; 2:457-65. PMC: 3949099. DOI: 10.1016/j.redox.2013.12.020. View

4.
. Kick the bar chart habit. Nat Methods. 2014; 11(2):113. DOI: 10.1038/nmeth.2837. View

5.
Guo Y, Logan H, Glueck D, Muller K . Selecting a sample size for studies with repeated measures. BMC Med Res Methodol. 2013; 13:100. PMC: 3734029. DOI: 10.1186/1471-2288-13-100. View