» Articles » PMID: 34707933

Cdev: a Ground-truth Based Measure to Evaluate RNA-seq Normalization Performance

Overview
Journal PeerJ
Date 2021 Oct 28
PMID 34707933
Citations 2
Authors
Affiliations
Soon will be listed here.
Abstract

Normalization of RNA-seq data has been an active area of research since the problem was first recognized a decade ago. Despite the active development of new normalizers, their performance measures have been given little attention. To evaluate normalizers, researchers have been relying on measures, most of which are either qualitative, potentially biased, or easily confounded by parametric choices of downstream analysis. We propose a metric called condition-number based deviation, or to quantify normalization success. measures how much an expression matrix differs from another. If a ground truth normalization is given, can then be used to evaluate the performance of normalizers. To establish experimental ground truth, we compiled an extensive set of public RNA-seq assays with external spike-ins. This data collection, together with provides a valuable toolset for benchmarking new and existing normalization methods.

Citing Articles

Genomic variant benchmark: if you cannot measure it, you cannot improve it.

Majidian S, Agustinho D, Chin C, Sedlazeck F, Mahmoud M Genome Biol. 2023; 24(1):221.

PMID: 37798733 PMC: 10552390. DOI: 10.1186/s13059-023-03061-1.


Temporal progress of gene expression analysis with RNA-Seq data: A review on the relationship between computational methods.

Costa-Silva J, Domingues D, Menotti D, Hungria M, Lopes F Comput Struct Biotechnol J. 2022; 21:86-98.

PMID: 36514333 PMC: 9730150. DOI: 10.1016/j.csbj.2022.11.051.

References
1.
Zhuo B, Emerson S, Chang J, Di Y . Identifying stably expressed genes from multiple RNA-Seq data sets. PeerJ. 2016; 4:e2791. PMC: 5178351. DOI: 10.7717/peerj.2791. View

2.
Lesluyes T, Baud J, Perot G, Charon-Barra C, You A, Valo I . Genomic and transcriptomic comparison of post-radiation versus sporadic sarcomas. Mod Pathol. 2019; 32(12):1786-1794. DOI: 10.1038/s41379-019-0300-2. View

3.
Sun J, Nishiyama T, Shimizu K, Kadota K . TCC: an R package for comparing tag count data with robust normalization strategies. BMC Bioinformatics. 2013; 14:219. PMC: 3716788. DOI: 10.1186/1471-2105-14-219. View

4.
. A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium. Nat Biotechnol. 2014; 32(9):903-14. PMC: 4321899. DOI: 10.1038/nbt.2957. View

5.
Soneson C, Delorenzi M . A comparison of methods for differential expression analysis of RNA-seq data. BMC Bioinformatics. 2013; 14:91. PMC: 3608160. DOI: 10.1186/1471-2105-14-91. View