» Articles » PMID: 30609939

Plyranges: a Grammar of Genomic Data Transformation

Overview
Journal Genome Biol
Specialties Biology
Genetics
Date 2019 Jan 6
PMID 30609939
Citations 61
Authors
Affiliations
Soon will be listed here.
Abstract

Bioconductor is a widely used R-based platform for genomics, but its host of complex genomic data structures places a cognitive burden on the user. For most tasks, the GRanges object would suffice, but there are gaps in the API that prevent its general use. By recognizing that the GRanges class follows "tidy" data principles, we create a grammar of genomic data transformation, defining verbs for performing actions on and between genomic interval data and providing a way of performing common data analysis tasks through a coherent interface to existing Bioconductor infrastructure. We implement this grammar as a Bioconductor/R package called plyranges.

Citing Articles

Alternative splicing of transposable elements in human breast cancer.

Nesta A, Veiga D, Banchereau J, Anczukow O, Beck C Mob DNA. 2025; 16(1):6.

PMID: 39987084 PMC: 11846448. DOI: 10.1186/s13100-025-00341-4.


A Bioconductor/R Workflow for the Detection and Visualization of Differential Chromatin Loops.

Flores J, Davis E, Kramer N, Love M, Phanstiel D F1000Res. 2025; 13:1346.

PMID: 39931328 PMC: 11809633. DOI: 10.12688/f1000research.153949.1.


Coordinated repression of totipotency-associated gene loci by histone methyltransferase EHMT2 through binding to LINE-1 regulatory elements.

Chatterjee K, Uyehara C, Kasliwal K, Madhuranath S, Scourzic L, Polyzos A bioRxiv. 2025; .

PMID: 39763795 PMC: 11702699. DOI: 10.1101/2024.12.18.629181.


Integrative multiomics reveals common endotypes across PSEN1, PSEN2, and APP mutations in familial Alzheimer's disease.

Valdes P, Caldwell A, Liu Q, Fitzgerald M, Ramachandran S, Karch C Alzheimers Res Ther. 2025; 17(1):5.

PMID: 39754192 PMC: 11699654. DOI: 10.1186/s13195-024-01659-6.


METTL3/MYCN cooperation drives neural crest differentiation and provides therapeutic vulnerability in neuroblastoma.

Thombare K, Vaid R, Pucci P, Ihrmark Lundberg K, Ayyalusamy R, Baig M EMBO J. 2024; 43(24):6310-6335.

PMID: 39528654 PMC: 11649786. DOI: 10.1038/s44318-024-00299-8.


References
1.
Yin T, Cook D, Lawrence M . ggbio: an R package for extending the grammar of graphics for genomic data. Genome Biol. 2012; 13(8):R77. PMC: 4053745. DOI: 10.1186/gb-2012-13-8-r77. View

2.
Riemondy K, Sheridan R, Gillen A, Yu Y, Bennett C, Hesselberth J . valr: Reproducible genome interval analysis in R. F1000Res. 2017; 6:1025. PMC: 5506536. DOI: 10.12688/f1000research.11997.1. View

3.
Lawrence M, Huber W, Pages H, Aboyoun P, Carlson M, Gentleman R . Software for computing and annotating genomic ranges. PLoS Comput Biol. 2013; 9(8):e1003118. PMC: 3738458. DOI: 10.1371/journal.pcbi.1003118. View

4.
Lee S, Cook D, Lawrence M . plyranges: a grammar of genomic data transformation. Genome Biol. 2019; 20(1):4. PMC: 6320618. DOI: 10.1186/s13059-018-1597-8. View

5.
Dale R, Pedersen B, Quinlan A . Pybedtools: a flexible Python library for manipulating genomic datasets and annotations. Bioinformatics. 2011; 27(24):3423-4. PMC: 3232365. DOI: 10.1093/bioinformatics/btr539. View