» Articles » PMID: 37624924

Gonomics: Uniting High Performance and Readability for Genomics with Go

Overview
Journal Bioinformatics
Specialty Biology
Date 2023 Aug 25
PMID 37624924
Authors
Affiliations
Soon will be listed here.
Abstract

Summary: Many existing software libraries for genomics require researchers to pick between competing considerations: the performance of compiled languages and the accessibility of interpreted languages. Go, a modern compiled language, provides an opportunity to address this conflict. We introduce Gonomics, an open-source collection of command line programs and bioinformatic libraries implemented in Go that unites readability and performance for genomic analyses. Gonomics contains packages to read, write, and manipulate a wide array of file formats (e.g. FASTA, FASTQ, BED, BEDPE, SAM, BAM, and VCF), and can convert and interface between these formats. Furthermore, our modular library structure provides a flexible platform for researchers developing their own software tools to address specific questions. These commands can be combined and incorporated into complex pipelines to meet the growing need for high-performance bioinformatic resources.

Availability And Implementation: Gonomics is implemented in the Go programming language. Source code, installation instructions, and documentation are freely available at https://github.com/vertgenlab/gonomics.

Citing Articles

Genome Sequence of a Marine Threespine Stickleback () from Rabbit Slough in the Cook Inlet.

Au E, Weaver S, Katikaneni A, Wucherpfennig J, Luo Y, Mangan R bioRxiv. 2025; .

PMID: 39975098 PMC: 11839064. DOI: 10.1101/2025.02.06.636934.


Epigenetic control of metabolic identity across cell types.

Pacheco M, Gerard D, Mangan R, Chapman A, Hecker D, Kellis M bioRxiv. 2024; .

PMID: 39091778 PMC: 11291179. DOI: 10.1101/2024.07.24.604914.

References
1.
Bonfield J, Marshall J, Danecek P, Li H, Ohan V, Whitwham A . HTSlib: C library for reading/writing high-throughput sequencing data. Gigascience. 2021; 10(2). PMC: 7931820. DOI: 10.1093/gigascience/giab007. View

2.
Neph S, Kuehn M, Reynolds A, Haugen E, Thurman R, Johnson A . BEDOPS: high-performance genomic feature operations. Bioinformatics. 2012; 28(14):1919-20. PMC: 3389768. DOI: 10.1093/bioinformatics/bts277. View

3.
Mao C, Eran A, Luo Y . Efficient Genomic Interval Queries Using Augmented Range Trees. Sci Rep. 2019; 9(1):5059. PMC: 6434014. DOI: 10.1038/s41598-019-41451-3. View

4.
Mangan R, Alsina F, Mosti F, Sotelo-Fonseca J, Snellings D, Au E . Adaptive sequence divergence forged new neurodevelopmental enhancers in humans. Cell. 2022; 185(24):4587-4603.e23. PMC: 10013929. DOI: 10.1016/j.cell.2022.10.016. View

5.
Mousavi N, Margoliash J, Pusarla N, Saini S, Yanicky R, Gymrek M . TRTools: a toolkit for genome-wide analysis of tandem repeats. Bioinformatics. 2020; 37(5):731-733. PMC: 8097685. DOI: 10.1093/bioinformatics/btaa736. View