ReGenotyper: Detecting Mislabeled Samples in Genetic Data
Overview
Authors
Affiliations
In high-throughput molecular profiling studies, genotype labels can be wrongly assigned at various experimental steps; the resulting mislabeled samples seriously reduce the power to detect the genetic basis of phenotypic variation. We have developed an approach to detect potential mislabeling, recover the "ideal" genotype and identify "best-matched" labels for mislabeled samples. On average, we identified 4% of samples as mislabeled in eight published datasets, highlighting the necessity of applying a "data cleaning" step before standard data analysis.
Reassessing Hybridisation in Australian Stingless Bees Using Multiple Genetic Markers.
Hereward J, Smith T, Gloag R, Brookes D, Walter G Ecol Evol. 2025; 15(2):e70912.
PMID: 39896774 PMC: 11775563. DOI: 10.1002/ece3.70912.
Snoek B, Sterken M, Nijveen H, Volkers R, Riksen J, Rosenstiel P G3 (Bethesda). 2021; 11(10).
PMID: 34568931 PMC: 8496280. DOI: 10.1093/g3journal/jkab258.
Khalilisamani N, Thomson P, Raadsma H, Khatkar M Sci Rep. 2021; 11(1):18318.
PMID: 34526591 PMC: 8443606. DOI: 10.1038/s41598-021-97873-5.
iDEP Web Application for RNA-Seq Data Analysis.
Ge X Methods Mol Biol. 2021; 2284:417-443.
PMID: 33835455 DOI: 10.1007/978-1-0716-1307-8_22.
Tran P, Tran L, Nechtman J, Santos B, Purohit S, Bin Satter K Sci Rep. 2020; 10(1):20651.
PMID: 33244057 PMC: 7692499. DOI: 10.1038/s41598-020-77777-6.