WGBSSuite: Simulating Whole-genome Bisulphite Sequencing Data and Benchmarking Differential DNA Methylation Analysis Tools
Overview
Affiliations
Motivation: As the number of studies looking at differences between DNA methylation increases, there is a growing demand to develop and benchmark statistical methods to analyse these data. To date no objective approach for the comparison of these methods has been developed and as such it remains difficult to assess which analysis tool is most appropriate for a given experiment. As a result, there is an unmet need for a DNA methylation data simulator that can accurately reproduce a wide range of experimental setups, and can be routinely used to compare the performance of different statistical models.
Results: We have developed WGBSSuite, a flexible stochastic simulation tool that generates single-base resolution DNA methylation data genome-wide. Several simulator parameters can be derived directly from real datasets provided by the user in order to mimic real case scenarios. Thus, it is possible to choose the most appropriate statistical analysis tool for a given simulated design. To show the usefulness of our simulator, we also report a benchmark of commonly used methods for differential methylation analysis.
Availability And Implementation: WGBS code and documentation are available under GNU licence at http://www.wgbssuite.org.uk/
Contact: : owen.rackham@imperial.ac.uk or l.bottolo@imperial.ac.uk
Supplementary Information: Supplementary data are available at Bioinformatics online.
Calibrating epigenetic clocks with training data error.
Mayne B, Berry O, Jarman S Evol Appl. 2023; 16(8):1496-1502.
PMID: 37622096 PMC: 10445086. DOI: 10.1111/eva.13582.
Assessing the Differential Methylation Analysis Quality for Microarray and NGS Platforms.
Budkina A, Medvedeva Y, Stupnikov A Int J Mol Sci. 2023; 24(10).
PMID: 37239934 PMC: 10218268. DOI: 10.3390/ijms24108591.
Calling differentially methylated regions from whole genome bisulphite sequencing with DMRcate.
Peters T, Buckley M, Chen Y, Smyth G, Goodnow C, Clark S Nucleic Acids Res. 2021; 49(19):e109.
PMID: 34320181 PMC: 8565305. DOI: 10.1093/nar/gkab637.
Xu P, Chen H, Hu J, Cai W Commun Biol. 2021; 4(1):835.
PMID: 34215844 PMC: 8253727. DOI: 10.1038/s42003-021-02342-4.
Vellame D, Castanho I, Dahir A, Mill J, Hannon E BMC Genomics. 2021; 22(1):446.
PMID: 34126923 PMC: 8204428. DOI: 10.1186/s12864-021-07721-z.