Cdev: a Ground-truth Based Measure to Evaluate RNA-seq Normalization Performance
Overview
Environmental Health
General Medicine
Authors
Affiliations
Normalization of RNA-seq data has been an active area of research since the problem was first recognized a decade ago. Despite the active development of new normalizers, their performance measures have been given little attention. To evaluate normalizers, researchers have been relying on measures, most of which are either qualitative, potentially biased, or easily confounded by parametric choices of downstream analysis. We propose a metric called condition-number based deviation, or to quantify normalization success. measures how much an expression matrix differs from another. If a ground truth normalization is given, can then be used to evaluate the performance of normalizers. To establish experimental ground truth, we compiled an extensive set of public RNA-seq assays with external spike-ins. This data collection, together with provides a valuable toolset for benchmarking new and existing normalization methods.
Genomic variant benchmark: if you cannot measure it, you cannot improve it.
Majidian S, Agustinho D, Chin C, Sedlazeck F, Mahmoud M Genome Biol. 2023; 24(1):221.
PMID: 37798733 PMC: 10552390. DOI: 10.1186/s13059-023-03061-1.
Costa-Silva J, Domingues D, Menotti D, Hungria M, Lopes F Comput Struct Biotechnol J. 2022; 21:86-98.
PMID: 36514333 PMC: 9730150. DOI: 10.1016/j.csbj.2022.11.051.