» Articles » PMID: 24810143

A Comparison of Peak Callers Used for DNase-Seq Data

Overview
Journal PLoS One
Date 2014 May 10
PMID 24810143
Citations 42
Authors
Affiliations
Soon will be listed here.
Abstract

Genome-wide profiling of open chromatin regions using DNase I and high-throughput sequencing (DNase-seq) is an increasingly popular approach for finding and studying regulatory elements. A variety of algorithms have been developed to identify regions of open chromatin from raw sequence-tag data, which has motivated us to assess and compare their performance. In this study, four published, publicly available peak calling algorithms used for DNase-seq data analysis (F-seq, Hotspot, MACS and ZINBA) are assessed at a range of signal thresholds on two published DNase-seq datasets for three cell types. The results were benchmarked against an independent dataset of regulatory regions derived from ENCODE in vivo transcription factor binding data for each particular cell type. The level of overlap between peak regions reported by each algorithm and this ENCODE-derived reference set was used to assess sensitivity and specificity of the algorithms. Our study suggests that F-seq has a slightly higher sensitivity than the next best algorithms. Hotspot and the ChIP-seq oriented method, MACS, both perform competitively when used with their default parameters. However the generic peak finder ZINBA appears to be less sensitive than the other three. We also assess accuracy of each algorithm over a range of signal thresholds. In particular, we show that the accuracy of F-Seq can be considerably improved by using a threshold setting that is different from the default value.

Citing Articles

A genome scale transcriptional regulatory model of the human placenta.

Paquette A, Ahuna K, Hwang Y, Pearl J, Liao H, Shannon P Sci Adv. 2024; 10(26):eadf3411.

PMID: 38941464 PMC: 11212735. DOI: 10.1126/sciadv.adf3411.


Chromatin accessibility profiling reveals that human fibroblasts respond to mechanical stimulation in a cell-specific manner.

Logan N, Broda K, Pantelireis N, Williams G, Higgins C JBMR Plus. 2024; 8(5):ziae025.

PMID: 38682000 PMC: 11055960. DOI: 10.1093/jbmrpl/ziae025.


Chromatin accessibility profiling methods.

Minnoye L, Marinov G, Krausgruber T, Pan L, Marand A, Secchia S Nat Rev Methods Primers. 2024; 1.

PMID: 38410680 PMC: 10895463. DOI: 10.1038/s43586-020-00008-9.


JMnorm: a novel joint multi-feature normalization method for integrative and comparative epigenomics.

Xiang G, Guo Y, Bumcrot D, Sigova A Nucleic Acids Res. 2023; 52(2):e11.

PMID: 38055833 PMC: 10810286. DOI: 10.1093/nar/gkad1146.


Evaluating deep learning for predicting epigenomic profiles.

Toneyan S, Tang Z, Koo P Nat Mach Intell. 2023; 4(12):1088-1100.

PMID: 37324054 PMC: 10270674. DOI: 10.1038/s42256-022-00570-9.


References
1.
Wang Y, Zhou P, Wang L, Li Z, Zhang Y, Zhang Y . Correlation between DNase I hypersensitive site distribution and gene expression in HeLa S3 cells. PLoS One. 2012; 7(8):e42414. PMC: 3416863. DOI: 10.1371/journal.pone.0042414. View

2.
John S, Sabo P, Thurman R, Sung M, Biddie S, Johnson T . Chromatin accessibility pre-determines glucocorticoid receptor binding patterns. Nat Genet. 2011; 43(3):264-8. PMC: 6386452. DOI: 10.1038/ng.759. View

3.
Neph S, Kuehn M, Reynolds A, Haugen E, Thurman R, Johnson A . BEDOPS: high-performance genomic feature operations. Bioinformatics. 2012; 28(14):1919-20. PMC: 3389768. DOI: 10.1093/bioinformatics/bts277. View

4.
Landt S, Marinov G, Kundaje A, Kheradpour P, Pauli F, Batzoglou S . ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res. 2012; 22(9):1813-31. PMC: 3431496. DOI: 10.1101/gr.136184.111. View

5.
Kim T, Ren B . Genome-wide analysis of protein-DNA interactions. Annu Rev Genomics Hum Genet. 2006; 7:81-102. DOI: 10.1146/annurev.genom.7.080505.115634. View