MatchRanges: Generating Null Hypothesis Genomic Ranges Via Covariate-matched Sampling
Overview
Affiliations
Motivation: Deriving biological insights from genomic data commonly requires comparing attributes of selected genomic loci to a null set of loci. The selection of this null set is non-trivial, as it requires careful consideration of potential covariates, a problem that is exacerbated by the non-uniform distribution of genomic features including genes, enhancers, and transcription factor binding sites. Propensity score-based covariate matching methods allow the selection of null sets from a pool of possible items while controlling for multiple covariates; however, existing packages do not operate on genomic data classes and can be slow for large data sets making them difficult to integrate into genomic workflows.
Results: To address this, we developed matchRanges, a propensity score-based covariate matching method for the efficient and convenient generation of matched null ranges from a set of background ranges within the Bioconductor framework.
Availability And Implementation: Package: https://bioconductor.org/packages/nullranges, Code: https://github.com/nullranges, Documentation: https://nullranges.github.io/nullranges.
Kramer N, Byun S, Coryell P, DCosta S, Thulson E, Kim H Cell Genom. 2025; 5(1):100738.
PMID: 39788104 PMC: 11770232. DOI: 10.1016/j.xgen.2024.100738.
Gaussian processes for time series with lead-lag effects with applications to biology data.
Mu W, Chen J, Davis E, Reed K, Phanstiel D, Love M Biometrics. 2025; 81(1).
PMID: 39775854 PMC: 11704948. DOI: 10.1093/biomtc/ujae156.
Kramer N, Byun S, Coryell P, DCosta S, Thulson E, Kim H bioRxiv. 2024; .
PMID: 38952796 PMC: 11216363. DOI: 10.1101/2024.05.05.592567.
The tidyomics ecosystem: enhancing omic data analyses.
Hutchison W, Keyes T, Crowell H, Serizay J, Soneson C, Davis E Nat Methods. 2024; 21(7):1166-1170.
PMID: 38877315 DOI: 10.1038/s41592-024-02299-2.
The ecosystem: Enhancing omic data analyses.
Hutchison W, Keyes T, Crowell H, Serizay J, Soneson C, Davis E bioRxiv. 2024; .
PMID: 38826347 PMC: 11142095. DOI: 10.1101/2023.09.10.557072.