» Articles » PMID: 22087737

ReCount: a Multi-experiment Resource of Analysis-ready RNA-seq Gene Count Datasets

Overview
Publisher Biomed Central
Specialty Biology
Date 2011 Nov 18
PMID 22087737
Citations 78
Authors
Affiliations
Soon will be listed here.
Abstract

Unlabelled:

Background: RNA sequencing is a flexible and powerful new approach for measuring gene, exon, or isoform expression. To maximize the utility of RNA sequencing data, new statistical methods are needed for clustering, differential expression, and other analyses. A major barrier to the development of new statistical methods is the lack of RNA sequencing datasets that can be easily obtained and analyzed in common statistical software packages such as R. To speed up the development process, we have created a resource of analysis-ready RNA-sequencing datasets. 2 DESCRIPTION: ReCount is an online resource of RNA-seq gene count tables and auxilliary data. Tables were built from raw RNA sequencing data from 18 different published studies comprising 475 samples and over 8 billion reads. Using the Myrna package, reads were aligned, overlapped with gene models and tabulated into gene-by-sample count tables that are ready for statistical analysis. Count tables and phenotype data were combined into Bioconductor ExpressionSet objects for ease of analysis. ReCount also contains the Myrna manifest files and R source code used to process the samples, allowing statistical and computational scientists to consider alternative parameter values. 3 CONCLUSIONS: By combining datasets from many studies and providing data that has already been processed from. fastq format into ready-to-use. RData and. txt files, ReCount facilitates analysis and methods development for RNA-seq count data. We anticipate that ReCount will also be useful for investigators who wish to consider cross-study comparisons and alternative normalization strategies for RNA-seq.

Citing Articles

To Tweak or Not to Tweak. How Exploiting Flexibilities in Gene Set Analysis Leads to Overoptimism.

Wunsch M, Sauer C, Herrmann M, Hinske L, Boulesteix A Biom J. 2024; 67(1):e70016.

PMID: 39698741 PMC: 11656295. DOI: 10.1002/bimj.70016.


An evaluation of RNA-seq differential analysis methods.

Li D, Zand M, Dye T, Goniewicz M, Rahman I, Xie Z PLoS One. 2022; 17(9):e0264246.

PMID: 36112652 PMC: 9480998. DOI: 10.1371/journal.pone.0264246.


Sparse sliced inverse regression for high dimensional data analysis.

Hilafu H, Safo S BMC Bioinformatics. 2022; 23(1):168.

PMID: 35525975 PMC: 9080177. DOI: 10.1186/s12859-022-04700-3.


Addressing the mean-correlation relationship in co-expression analysis.

Wang Y, Hicks S, Hansen K PLoS Comput Biol. 2022; 18(3):e1009954.

PMID: 35353807 PMC: 9009771. DOI: 10.1371/journal.pcbi.1009954.


Comprehensive analysis of an immune infiltrate-related competitive endogenous RNA network reveals potential prognostic biomarkers for non-small cell lung cancer.

Yang C, Hu L, Huang Z, Deng L, Guo W, Liu S PLoS One. 2021; 16(12):e0260720.

PMID: 34855841 PMC: 8639052. DOI: 10.1371/journal.pone.0260720.


References
1.
Pickrell J, Marioni J, Pai A, Degner J, Engelhardt B, Nkadori E . Understanding mechanisms underlying human gene expression variation with RNA sequencing. Nature. 2010; 464(7289):768-72. PMC: 3089435. DOI: 10.1038/nature08872. View

2.
Storey J, Madeoy J, Strout J, Wurfel M, Ronald J, Akey J . Gene-expression variation within and among human populations. Am J Hum Genet. 2007; 80(3):502-9. PMC: 1821107. DOI: 10.1086/512017. View

3.
Auer P, Doerge R . Statistical design and analysis of RNA sequencing data. Genetics. 2010; 185(2):405-16. PMC: 2881125. DOI: 10.1534/genetics.110.114983. View

4.
Hammer P, Banck M, Amberg R, Wang C, Petznick G, Luo S . mRNA-seq with agnostic splice site discovery for nervous system transcriptomics tested in chronic pain. Genome Res. 2010; 20(6):847-60. PMC: 2877581. DOI: 10.1101/gr.101204.109. View

5.
Wang E, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C . Alternative isoform regulation in human tissue transcriptomes. Nature. 2008; 456(7221):470-6. PMC: 2593745. DOI: 10.1038/nature07509. View