» Articles » PMID: 39255248

Review and Evaluate the Bioinformatics Analysis Strategies of ATAC-seq and CUT&Tag Data

Overview
Specialty Biology
Date 2024 Sep 10
PMID 39255248
Authors
Affiliations
Soon will be listed here.
Abstract

Efficient and reliable profiling methods are essential to study epigenetics. Tn5, one of the first identified prokaryotic transposases with high DNA-binding and tagmentation efficiency, is widely adopted in different genomic and epigenomic protocols for high-throughputly exploring the genome and epigenome. Based on Tn5, the Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) and the Cleavage Under Targets and Tagmentation (CUT&Tag) were developed to measure chromatin accessibility and detect DNA-protein interactions. These methodologies can be applied to large amounts of biological samples with low-input levels, such as rare tissues, embryos, and sorted single cells. However, fast and proper processing of these epigenomic data has become a bottleneck because massive data production continues to increase quickly. Furthermore, inappropriate data analysis can generate biased or misleading conclusions. Therefore, it is essential to evaluate the performance of Tn5-based ATAC-seq and CUT&Tag data processing bioinformatics tools, many of which were developed mostly for analyzing chromatin immunoprecipitation followed by sequencing (ChIP-seq) data. Here, we conducted a comprehensive benchmarking analysis to evaluate the performance of eight popular software for processing ATAC-seq and CUT&Tag data. We compared the sensitivity, specificity, and peak width distribution for both narrow-type and broad-type peak calling. We also tested the influence of the availability of control IgG input in CUT&Tag data analysis. Finally, we evaluated the differential analysis strategies commonly used for analyzing the CUT&Tag data. Our study provided comprehensive guidance for selecting bioinformatics tools and recommended analysis strategies, which were implemented into Docker/Singularity images for streamlined data analysis.

References
1.
Kornberg R, Lorch Y . Primary Role of the Nucleosome. Mol Cell. 2020; 79(3):371-375. DOI: 10.1016/j.molcel.2020.07.020. View

2.
Ou J, Liu H, Yu J, Kelliher M, Castilla L, Lawson N . ATACseqQC: a Bioconductor package for post-alignment quality assessment of ATAC-seq data. BMC Genomics. 2018; 19(1):169. PMC: 5831847. DOI: 10.1186/s12864-018-4559-3. View

3.
Criscuolo A, Brisse S . AlienTrimmer removes adapter oligonucleotides with high sensitivity in short-insert paired-end reads. Commentary on Turner (2014) Assessment of insert sizes and adapter content in FASTQ data from NexteraXT libraries. Front Genet. 2014; 5:130. PMC: 4026695. DOI: 10.3389/fgene.2014.00130. View

4.
Boyle A, Davis S, Shulha H, Meltzer P, Margulies E, Weng Z . High-resolution mapping and characterization of open chromatin across the genome. Cell. 2008; 132(2):311-22. PMC: 2669738. DOI: 10.1016/j.cell.2007.12.014. View

5.
Li B, Carey M, Workman J . The role of chromatin during transcription. Cell. 2007; 128(4):707-19. DOI: 10.1016/j.cell.2007.01.015. View