» Articles » PMID: 28369191

The RNASeq-er API-a Gateway to Systematically Updated Analysis of Public RNA-seq Data

Overview
Journal Bioinformatics
Specialty Biology
Date 2017 Apr 4
PMID 28369191
Citations 19
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: The exponential growth of publicly available RNA-sequencing (RNA-Seq) data poses an increasing challenge to researchers wishing to discover, analyse and store such data, particularly those based in institutions with limited computational resources. EMBL-EBI is in an ideal position to address these challenges and to allow the scientific community easy access to not just raw, but also processed RNA-Seq data. We present a Web service to access the results of a systematically and continually updated standardized alignment as well as gene and exon expression quantification of all public bulk (and in the near future also single-cell) RNA-Seq runs in 264 species in European Nucleotide Archive, using Representational State Transfer.

Results: The RNASeq-er API (Application Programming Interface) enables ontology-powered search for and retrieval of CRAM, bigwig and bedGraph files, gene and exon expression quantification matrices (Fragments Per Kilobase Of Exon Per Million Fragments Mapped, Transcripts Per Million, raw counts) as well as sample attributes annotated with ontology terms. To date over 270 00 RNA-Seq runs in nearly 10 000 studies (1PB of raw FASTQ data) in 264 species in ENA have been processed and made available via the API.

Availability And Implementation: The RNASeq-er API can be accessed at http://www.ebi.ac.uk/fg/rnaseq/api . The commands used to analyse the data are available in supplementary materials and at https://github.com/nunofonseca/irap/wiki/iRAP-single-library .

Contact: rnaseq@ebi.ac.uk ; rpetry@ebi.ac.uk.

Supplementary Information: Supplementary data are available at Bioinformatics online.

Citing Articles

transcriptome assembly from a compendium of RNA-seq data sets.

Tjaden B RNA Biol. 2023; 20(1):77-84.

PMID: 36920168 PMC: 10392735. DOI: 10.1080/15476286.2023.2189331.


Differential Expression Enrichment Tool (DEET): an interactive atlas of human differential gene expression.

Sokolowski D, Ahn J, Erdman L, Hou H, Ellis K, Wang L NAR Genom Bioinform. 2023; 5(1):lqad003.

PMID: 36694664 PMC: 9869326. DOI: 10.1093/nargab/lqad003.


The miR-429 suppresses proliferation and migration in glioblastoma cells and induces cell-cycle arrest and apoptosis via modulating several target genes of ERBB signaling pathway.

Gheidari F, Arefian E, Saadatpour F, Kabiri M, Seyedjafari E, Teimoori-Toolabi L Mol Biol Rep. 2022; 49(12):11855-11866.

PMID: 36219319 DOI: 10.1007/s11033-022-07903-2.


Pan-phylum Analyses of Nematode Endocannabinoid Signalling Systems Highlight Novel Opportunities for Parasite Drug Target Discovery.

Crooks B, McKenzie D, Cadd L, McCoy C, McVeigh P, Marks N Front Endocrinol (Lausanne). 2022; 13:892758.

PMID: 35846343 PMC: 9283691. DOI: 10.3389/fendo.2022.892758.


Evaluation of critical data processing steps for reliable prediction of gene co-expression from large collections of RNA-seq data.

Vandenbon A PLoS One. 2022; 17(1):e0263344.

PMID: 35089979 PMC: 8797241. DOI: 10.1371/journal.pone.0263344.


References
1.
Howe K, Bolt B, Cain S, Chan J, Chen W, Davis P . WormBase 2016: expanding to enable helminth genomic research. Nucleic Acids Res. 2015; 44(D1):D774-80. PMC: 4702863. DOI: 10.1093/nar/gkv1217. View

2.
Anders S, Pyl P, Huber W . HTSeq--a Python framework to work with high-throughput sequencing data. Bioinformatics. 2014; 31(2):166-9. PMC: 4287950. DOI: 10.1093/bioinformatics/btu638. View

3.
Kolesnikov N, Hastings E, Keays M, Melnichuk O, Tang Y, Williams E . ArrayExpress update--simplifying data submissions. Nucleic Acids Res. 2014; 43(Database issue):D1113-6. PMC: 4383899. DOI: 10.1093/nar/gku1057. View

4.
Petryszak R, Burdett T, Fiorelli B, Fonseca N, Gonzalez-Porta M, Hastings E . Expression Atlas update--a database of gene and transcript expression from microarray- and sequencing-based functional genomics experiments. Nucleic Acids Res. 2013; 42(Database issue):D926-32. PMC: 3964963. DOI: 10.1093/nar/gkt1270. View

5.
Petryszak R, Keays M, Tang Y, Fonseca N, Barrera E, Burdett T . Expression Atlas update--an integrated database of gene and protein expression in humans, animals and plants. Nucleic Acids Res. 2015; 44(D1):D746-52. PMC: 4702781. DOI: 10.1093/nar/gkv1045. View