» Articles » PMID: 21903743

Differential Expression in RNA-seq: a Matter of Depth

Overview
Journal Genome Res
Specialty Genetics
Date 2011 Sep 10
PMID 21903743
Citations 778
Authors
Affiliations
Soon will be listed here.
Abstract

Next-generation sequencing (NGS) technologies are revolutionizing genome research, and in particular, their application to transcriptomics (RNA-seq) is increasingly being used for gene expression profiling as a replacement for microarrays. However, the properties of RNA-seq data have not been yet fully established, and additional research is needed for understanding how these data respond to differential expression analysis. In this work, we set out to gain insights into the characteristics of RNA-seq data analysis by studying an important parameter of this technology: the sequencing depth. We have analyzed how sequencing depth affects the detection of transcripts and their identification as differentially expressed, looking at aspects such as transcript biotype, length, expression level, and fold-change. We have evaluated different algorithms available for the analysis of RNA-seq and proposed a novel approach--NOISeq--that differs from existing methods in that it is data-adaptive and nonparametric. Our results reveal that most existing methodologies suffer from a strong dependency on sequencing depth for their differential expression calls and that this results in a considerable number of false positives that increases as the number of reads grows. In contrast, our proposed method models the noise distribution from the actual data, can therefore better adapt to the size of the data set, and is more effective in controlling the rate of false discoveries. This work discusses the true potential of RNA-seq for studying regulation at low expression ranges, the noise within RNA-seq data, and the issue of replication.

Citing Articles

FoxO3 controls cardiomyocyte proliferation and heart regeneration by regulating Sfrp2 expression in postnatal mice.

Xia J, Liu K, Lin X, Li H, Lin J, Li L Nat Commun. 2025; 16(1):2532.

PMID: 40087279 DOI: 10.1038/s41467-025-57962-9.


Prion protein promotes copper toxicity in Wilson disease.

Petruzzelli R, Catalano F, Crispino R, Polishchuk E, Elia M, Masone A Nat Commun. 2025; 16(1):1468.

PMID: 39922819 PMC: 11807206. DOI: 10.1038/s41467-025-56740-x.


The Convergent Evolution of Hummingbird Pollination Results in Repeated Floral Scent Loss Through Gene Downregulation.

Darragh K, Kay K, Ramirez S Mol Biol Evol. 2025; 42(2).

PMID: 39899329 PMC: 11848715. DOI: 10.1093/molbev/msaf027.


Comparative transcriptome profiling reveals the mechanism of increasing lysine and tryptophan content through pyramiding , and genes in maize.

Wu P, Yuan Y, Ma Z, Zhang K, Deng L, Ren H Breed Sci. 2025; 74(4):311-323.

PMID: 39872326 PMC: 11769590. DOI: 10.1270/jsbbs.23051.


Combined transcriptome and whole genome sequencing analyses reveal candidate drug-resistance genes of .

Yu Y, Dong H, Zhao Q, Zhu S, Wang H, Yao Y iScience. 2025; 28(1):111592.

PMID: 39811641 PMC: 11732515. DOI: 10.1016/j.isci.2024.111592.


References
1.
Lemay J, DAmours A, Lemieux C, Lackner D, St-Sauveur V, Bahler J . The nuclear poly(A)-binding protein interacts with the exosome to promote synthesis of noncoding small nucleolar RNAs. Mol Cell. 2010; 37(1):34-45. DOI: 10.1016/j.molcel.2009.12.019. View

2.
Kim V, Han J, Siomi M . Biogenesis of small RNAs in animals. Nat Rev Mol Cell Biol. 2009; 10(2):126-39. DOI: 10.1038/nrm2632. View

3.
Bloom J, Khan Z, Kruglyak L, Singh M, Caudy A . Measuring differential gene expression by short read sequencing: quantitative comparison to 2-channel gene expression microarrays. BMC Genomics. 2009; 10:221. PMC: 2686739. DOI: 10.1186/1471-2164-10-221. View

4.
Park P . ChIP-seq: advantages and challenges of a maturing technology. Nat Rev Genet. 2009; 10(10):669-80. PMC: 3191340. DOI: 10.1038/nrg2641. View

5.
Argout X, Salse J, Aury J, Guiltinan M, Droc G, Gouzy J . The genome of Theobroma cacao. Nat Genet. 2010; 43(2):101-8. DOI: 10.1038/ng.736. View