» Articles » PMID: 28535263

Gene Expression Variability and the Analysis of Large-scale RNA-seq Studies with the MDSeq

Overview
Specialty Biochemistry
Date 2017 May 24
PMID 28535263
Citations 18
Authors
Affiliations
Soon will be listed here.
Abstract

Rapidly decreasing cost of next-generation sequencing has led to the recent availability of large-scale RNA-seq data, that empowers the analysis of gene expression variability, in addition to gene expression means. In this paper, we present the MDSeq, based on the coefficient of dispersion, to provide robust and computationally efficient analysis of both gene expression means and variability on RNA-seq counts. The MDSeq utilizes a novel reparametrization of the negative binomial to provide flexible generalized linear models (GLMs) on both the mean and dispersion. We address challenges of analyzing large-scale RNA-seq data via several new developments to provide a comprehensive toolset that models technical excess zeros, identifies outliers efficiently, and evaluates differential expressions at biologically interesting levels. We evaluated performances of the MDSeq using simulated data when the ground truths are known. Results suggest that the MDSeq often outperforms current methods for the analysis of gene expression mean and variability. Moreover, the MDSeq is applied in two real RNA-seq studies, in which we identified functionally relevant genes and gene pathways. Specifically, the analysis of gene expression variability with the MDSeq on the GTEx human brain tissue data has identified pathways associated with common neurodegenerative disorders when gene expression means were conserved.

Citing Articles

Luminal epithelial cells integrate variable responses to aging into stereotypical changes that underlie breast cancer susceptibility.

Sayaman R, Miyano M, Carlson E, Senapati P, Zirbes A, Shalabi S Elife. 2024; 13.

PMID: 39545637 PMC: 11723586. DOI: 10.7554/eLife.95720.


Gene coexpression networks reveal a broad role for lncRNAs in inflammatory bowel disease.

Johnson J, Sargsyan D, Neiman E, Hart A, Stojmirovic A, Kosoy R JCI Insight. 2024; 9(3).

PMID: 38329124 PMC: 10967393. DOI: 10.1172/jci.insight.168988.


Measuring cell-to-cell expression variability in single-cell RNA-sequencing data: a comparative analysis and applications to B cell aging.

Zheng H, Vijg J, Fard A, Mar J Genome Biol. 2023; 24(1):238.

PMID: 37864221 PMC: 10588274. DOI: 10.1186/s13059-023-03036-2.


clrDV: a differential variability test for RNA-Seq data based on the skew-normal distribution.

Li H, Khang T PeerJ. 2023; 11:e16126.

PMID: 37790621 PMC: 10544356. DOI: 10.7717/peerj.16126.


Quantifying transcriptome diversity: a review.

Jones E, Haldar A, Oza V, Lasseigne B Brief Funct Genomics. 2023; 23(2):83-94.

PMID: 37225889 PMC: 11484519. DOI: 10.1093/bfgp/elad019.


References
1.
Brown A, Buil A, Vinuela A, Lappalainen T, Zheng H, Richards J . Genetic interactions affecting human gene expression identified by variance association mapping. Elife. 2014; 3:e01381. PMC: 4017648. DOI: 10.7554/eLife.01381. View

2.
Plaisier S, Taschereau R, Wong J, Graeber T . Rank-rank hypergeometric overlap: identification of statistically significant overlap between gene-expression signatures. Nucleic Acids Res. 2010; 38(17):e169. PMC: 2943622. DOI: 10.1093/nar/gkq636. View

3.
Richer J, Jacobsen B, Manning N, Abel M, Wolf D, Horwitz K . Differential gene regulation by the two progesterone receptor isoforms in human breast cancer cells. J Biol Chem. 2001; 277(7):5209-18. DOI: 10.1074/jbc.M110090200. View

4.
Smyth G . Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Stat Appl Genet Mol Biol. 2006; 3:Article3. DOI: 10.2202/1544-6115.1027. View

5.
Baran Y, Subramaniam M, Biton A, Tukiainen T, Tsang E, Rivas M . The landscape of genomic imprinting across diverse adult human tissues. Genome Res. 2015; 25(7):927-36. PMC: 4484390. DOI: 10.1101/gr.192278.115. View