GEOfetch: a Command-line Tool for Downloading Data and Standardized Metadata from GEO and SRA
Overview
Affiliations
Motivation: The Gene Expression Omnibus has become an important source of biological data for secondary analysis. However, there is no simple, programmatic way to download data and metadata from Gene Expression Omnibus (GEO) in a standardized annotation format.
Results: To address this, we present GEOfetch-a command-line tool that downloads and organizes data and metadata from GEO and SRA. GEOfetch formats the downloaded metadata as a Portable Encapsulated Project, providing universal format for the reanalysis of public data.
Availability And Implementation: GEOfetch is available on Bioconda and the Python Package Index (PyPI).
Methods for evaluating unsupervised vector representations of genomic regions.
Zheng G, Rymuza J, Gharavi E, LeRoy N, Zhang A, Sheffield N NAR Genom Bioinform. 2024; 6(3):lqae086.
PMID: 39131817 PMC: 11316252. DOI: 10.1093/nargab/lqae086.
LeRoy N, Khoroshevskyi O, OBrien A, Stepien R, Arslan A, Sheffield N Gigascience. 2024; 13.
PMID: 38991851 PMC: 11238423. DOI: 10.1093/gigascience/giae033.
Rostami F, Tavakol Hamedani Z, Sadoughi A, Mehrabadi M, Kouhkan F Sci Rep. 2024; 14(1):13542.
PMID: 38866824 PMC: 11169246. DOI: 10.1038/s41598-024-62064-5.
OMD Curation Toolkit: a workflow for in-house curation of public omics datasets.
Piquer-Esteban S, Arnau V, Diaz W, Moya A BMC Bioinformatics. 2024; 25(1):184.
PMID: 38724907 PMC: 11084137. DOI: 10.1186/s12859-024-05803-9.
Joint Representation Learning for Retrieval and Annotation of Genomic Interval Sets.
Gharavi E, LeRoy N, Zheng G, Zhang A, Brown D, Sheffield N Bioengineering (Basel). 2024; 11(3).
PMID: 38534537 PMC: 10967841. DOI: 10.3390/bioengineering11030263.