» Articles » PMID: 22914217

Systematic Comparison of RNA-Seq Normalization Methods Using Measurement Error Models

Overview
Journal Bioinformatics
Specialty Biology
Date 2012 Aug 24
PMID 22914217
Citations 14
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Further advancement of RNA-Seq technology and its application call for the development of effective normalization methods for RNA-Seq data. Currently, different normalization methods are compared and validated by their correlations with a certain gold standard. Gene expression measurements generated by a different technology or platform such as Real-time reverse transcription polymerase chain reaction (qRT-PCR) or Microarray are usually used as the gold standard. Although the current approach is intuitive and easy to implement, it becomes statistically inadequate when the gold standard is also subject to measurement error (ME). Furthermore, the current approach is not informative, because the correlation of a normalization method with a certain gold standard does not provide much information about the exact quality of the normalized RNA-Seq measurements.

Results: We propose to use the system of ME models based on qRT-PCR, Microarray and RNA-Seq gene expression data to compare and validate RNA-Seq normalization methods. This approach does not assume the existence of a gold standard. The performance of a normalization method can be characterized by a group of parameters of the system, which are referred to as the performance parameters, and these performance parameters can be consistently estimated. Different normalization methods can thus be compared by comparing their corresponding estimated performance parameters. We applied the proposed approach to compare five existing RNA-Seq normalization methods using the gene expression data of two RNA samples from the microArray Quality Control and Sequencing Quality Control projects and gained much insight about the pros and cons of these methods.

Citing Articles

Analyzing the relationship between gene expression and phenotype in space-flown mice using a causal inference machine learning ensemble.

Casaletto J, Scott R, Myrick M, Mackintosh G, Chok H, Saravia-Butler A Sci Rep. 2025; 15(1):2363.

PMID: 39824847 PMC: 11748630. DOI: 10.1038/s41598-024-81394-y.


Normalization of RNA-Seq data using adaptive trimmed mean with multi-reference.

Singh V, Kirtipal N, Song B, Lee S Brief Bioinform. 2024; 25(3).

PMID: 38770720 PMC: 11107385. DOI: 10.1093/bib/bbae241.


Error modelled gene expression analysis (EMOGEA) provides a superior overview of time course RNA-seq measurements and low count gene expression.

Barra J, Taverna F, Bong F, Ahmed I, Karakach T Brief Bioinform. 2024; 25(3).

PMID: 38770716 PMC: 11106635. DOI: 10.1093/bib/bbae233.


Transcriptome-wide profiling of acute stress induced changes in ribosome occupancy level using external standards.

Shieh A, Bansal S, Zuo Z, Wang S PLoS One. 2023; 18(11):e0294308.

PMID: 37988379 PMC: 10662766. DOI: 10.1371/journal.pone.0294308.


A field-wide assessment of differential expression profiling by high-throughput sequencing reveals widespread bias.

Pall T, Luidalepp H, Tenson T, Maivali U PLoS Biol. 2023; 21(3):e3002007.

PMID: 36862747 PMC: 10013925. DOI: 10.1371/journal.pbio.3002007.