» Articles » PMID: 25150838

A Comprehensive Assessment of RNA-seq Accuracy, Reproducibility and Information Content by the Sequencing Quality Control Consortium

Overview
Journal Nat Biotechnol
Specialty Biotechnology
Date 2014 Aug 25
PMID 25150838
Citations 508
Affiliations
Soon will be listed here.
Abstract

We present primary results from the Sequencing Quality Control (SEQC) project, coordinated by the US Food and Drug Administration. Examining Illumina HiSeq, Life Technologies SOLiD and Roche 454 platforms at multiple laboratory sites using reference RNA samples with built-in controls, we assess RNA sequencing (RNA-seq) performance for junction discovery and differential expression profiling and compare it to microarray and quantitative PCR (qPCR) data using complementary metrics. At all sequencing depths, we discover unannotated exon-exon junctions, with >80% validated by qPCR. We find that measurements of relative expression are accurate and reproducible across sites and platforms if specific filters are used. In contrast, RNA-seq and microarrays do not provide accurate absolute measurements, and gene-specific biases are observed for all examined platforms, including qPCR. Measurement performance depends on the platform and data analysis pipeline, and variation is large for transcript-level profiling. The complete SEQC data sets, comprising >100 billion reads (10Tb), provide unique resources for evaluating RNA-seq analyses for clinical and regulatory settings.

Citing Articles

Molecular subtyping of stage I lung adenocarcinoma via molecular alterations in pre-invasive lesion progression.

Shang J, Jiang H, Zhao Y, Yang J, Lin Y, Zhang N J Transl Med. 2025; 23(1):263.

PMID: 40038757 PMC: 11877874. DOI: 10.1186/s12967-025-06316-6.


Emerging Roles of Long Non-Coding RNAs in Cardiovascular Diseases.

Kong X, Li F, Wang Y J Cell Mol Med. 2025; 29(5):e70453.

PMID: 40032652 PMC: 11875779. DOI: 10.1111/jcmm.70453.


Sources of non-uniform coverage in short-read RNA-Seq data.

Brooks T, Lahens N, Mrcela A, Yang J, Purohit S, Naik A bioRxiv. 2025; .

PMID: 39975309 PMC: 11838458. DOI: 10.1101/2025.01.30.634337.


Worm Perturb-Seq: massively parallel whole-animal RNAi and RNA-seq.

Zhang H, Li X, Song D, Yukselen O, Nanda S, Kucukural A bioRxiv. 2025; .

PMID: 39975282 PMC: 11838469. DOI: 10.1101/2025.02.02.636107.


Phytochrome-mediated shade avoidance responses impact the structure and composition of the bacterial phyllosphere microbiome of Arabidopsis.

ORourke J, Vincent S, Williams I, Gascoyne E, Devlin P Environ Microbiome. 2025; 20(1):20.

PMID: 39915883 PMC: 11800596. DOI: 10.1186/s40793-025-00679-5.


References
1.
Huber W, von Heydebreck A, Sultmann H, Poustka A, Vingron M . Variance stabilization applied to microarray data calibration and to the quantification of differential expression. Bioinformatics. 2002; 18 Suppl 1:S96-104. DOI: 10.1093/bioinformatics/18.suppl_1.s96. View

2.
Benjamini Y, Speed T . Summarizing and correcting the GC content bias in high-throughput sequencing. Nucleic Acids Res. 2012; 40(10):e72. PMC: 3378858. DOI: 10.1093/nar/gks001. View

3.
Yu Y, Fuscoe J, Zhao C, Guo C, Jia M, Qing T . A rat RNA-Seq transcriptomic BodyMap across 11 organs and 4 developmental stages. Nat Commun. 2014; 5:3230. PMC: 3926002. DOI: 10.1038/ncomms4230. View

4.
t Hoen P, Friedlander M, Almlof J, Sammeth M, Pulyakhina I, Anvar S . Reproducibility of high-throughput mRNA and small RNA sequencing across laboratories. Nat Biotechnol. 2013; 31(11):1015-22. DOI: 10.1038/nbt.2702. View

5.
Wang E, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C . Alternative isoform regulation in human tissue transcriptomes. Nature. 2008; 456(7221):470-6. PMC: 2593745. DOI: 10.1038/nature07509. View