» Articles » PMID: 18550803

RNA-seq: an Assessment of Technical Reproducibility and Comparison with Gene Expression Arrays

Overview
Journal Genome Res
Specialty Genetics
Date 2008 Jun 14
PMID 18550803
Citations 1453
Authors
Affiliations
Soon will be listed here.
Abstract

Ultra-high-throughput sequencing is emerging as an attractive alternative to microarrays for genotyping, analysis of methylation patterns, and identification of transcription factor binding sites. Here, we describe an application of the Illumina sequencing (formerly Solexa sequencing) platform to study mRNA expression levels. Our goals were to estimate technical variance associated with Illumina sequencing in this context and to compare its ability to identify differentially expressed genes with existing array technologies. To do so, we estimated gene expression differences between liver and kidney RNA samples using multiple sequencing replicates, and compared the sequencing data to results obtained from Affymetrix arrays using the same RNA samples. We find that the Illumina sequencing data are highly replicable, with relatively little technical variation, and thus, for many purposes, it may suffice to sequence each mRNA sample only once (i.e., using one lane). The information in a single lane of Illumina sequencing data appears comparable to that in a single array in enabling identification of differentially expressed genes, while allowing for additional analyses such as detection of low-expressed genes, alternative splice variants, and novel transcripts. Based on our observations, we propose an empirical protocol and a statistical framework for the analysis of gene expression using ultra-high-throughput sequencing technology.

Citing Articles

Exploration of RNA-binding proteins identified RPS27 as a potential regulator associated with Kaposi's sarcoma development.

Zhang J, Wang P, Li T, Luo D, Qu Y, Ding Y BMC Cancer. 2025; 25(1):362.

PMID: 40016701 PMC: 11866810. DOI: 10.1186/s12885-025-13790-0.


Robust Cluster Prediction Across Data Types Validates Association of Sex and Therapy Response in GBM.

Gibbs D, Cioffi G, Aguilar B, Waite K, Pan E, Mandel J Cancers (Basel). 2025; 17(3).

PMID: 39941811 PMC: 11815886. DOI: 10.3390/cancers17030445.


Simplicity within biological complexity.

Przulj N, Malod-Dognin N Bioinform Adv. 2025; 5(1):vbae164.

PMID: 39927291 PMC: 11805345. DOI: 10.1093/bioadv/vbae164.


Exploring RNA-Seq Data Analysis Through Visualization Techniques and Tools: A Systematic Review of Opportunities and Limitations for Clinical Applications.

Manzoor F, Tsurgeon C, Gupta V Bioengineering (Basel). 2025; 12(1).

PMID: 39851330 PMC: 11760846. DOI: 10.3390/bioengineering12010056.


edgeR v4: powerful differential analysis of sequencing data with expanded functionality and improved support for small counts and larger datasets.

Chen Y, Chen L, Lun A, Baldoni P, Smyth G Nucleic Acids Res. 2025; 53(2).

PMID: 39844453 PMC: 11754124. DOI: 10.1093/nar/gkaf018.


References
1.
Birney E, Stamatoyannopoulos J, Dutta A, Guigo R, Gingeras T, Margulies E . Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 2007; 447(7146):799-816. PMC: 2212820. DOI: 10.1038/nature05874. View

2.
Torres T, Metta M, Ottenwalder B, Schlotterer C . Gene expression profiling by massively parallel sequencing. Genome Res. 2007; 18(1):172-7. PMC: 2134766. DOI: 10.1101/gr.6984908. View

3.
White K . Functional genomics and the study of development, variation and evolution. Nat Rev Genet. 2001; 2(7):528-37. DOI: 10.1038/35080565. View

4.
Gautier L, Cope L, Bolstad B, Irizarry R . affy--analysis of Affymetrix GeneChip data at the probe level. Bioinformatics. 2004; 20(3):307-15. DOI: 10.1093/bioinformatics/btg405. View

5.
Mikkelsen T, Ku M, Jaffe D, Issac B, Lieberman E, Giannoukos G . Genome-wide maps of chromatin state in pluripotent and lineage-committed cells. Nature. 2007; 448(7153):553-60. PMC: 2921165. DOI: 10.1038/nature06008. View