» Articles » PMID: 19447789

Hierarchical Hidden Markov Model with Application to Joint Analysis of ChIP-chip and ChIP-seq Data

Overview
Journal Bioinformatics
Specialty Biology
Date 2009 May 19
PMID 19447789
Citations 15
Authors
Affiliations
Soon will be listed here.
Abstract

Motivation: Chromatin immunoprecipitation (ChIP) experiments followed by array hybridization, or ChIP-chip, is a powerful approach for identifying transcription factor binding sites (TFBS) and has been widely used. Recently, massively parallel sequencing coupled with ChIP experiments (ChIP-seq) has been increasingly used as an alternative to ChIP-chip, offering cost-effective genome-wide coverage and resolution up to a single base pair. For many well-studied TFs, both ChIP-seq and ChIP-chip experiments have been applied and their data are publicly available. Previous analyses have revealed substantial technology-specific binding signals despite strong correlation between the two sets of results. Therefore, it is of interest to see whether the two data sources can be combined to enhance the detection of TFBS.

Results: In this work, hierarchical hidden Markov model (HHMM) is proposed for combining data from ChIP-seq and ChIP-chip. In HHMM, inference results from individual HMMs in ChIP-seq and ChIP-chip experiments are summarized by a higher level HMM. Simulation studies show the advantage of HHMM when data from both technologies co-exist. Analysis of two well-studied TFs, NRSF and CCCTC-binding factor (CTCF), also suggests that HHMM yields improved TFBS identification in comparison to analyses using individual data sources or a simple merger of the two.

Availability: Source code for the software ChIPmeta is freely available for download at http://www.umich.edu/~hwchoi/HHMMsoftware.zip, implemented in C and supported on linux.

Citing Articles

Statistical Issues in the Analysis of ChIP-Seq and RNA-Seq Data.

Ghosh D, Qin Z Genes (Basel). 2014; 1(2):317-34.

PMID: 24710049 PMC: 3954086. DOI: 10.3390/genes1020317.


Genome-wide signatures of transcription factor activity: connecting transcription factors, disease, and small molecules.

Chen J, Hu Z, Phatak M, Reichard J, Freudenberg J, Sivaganesan S PLoS Comput Biol. 2013; 9(9):e1003198.

PMID: 24039560 PMC: 3764016. DOI: 10.1371/journal.pcbi.1003198.


jMOSAiCS: joint analysis of multiple ChIP-seq datasets.

Zeng X, Sanalkumar R, Bresnick E, Li H, Chang Q, Keles S Genome Biol. 2013; 14(4):R38.

PMID: 23844871 PMC: 4053760. DOI: 10.1186/gb-2013-14-4-r38.


Joint analysis of expression profiles from multiple cancers improves the identification of microRNA-gene interactions.

Chen X, Slack F, Zhao H Bioinformatics. 2013; 29(17):2137-45.

PMID: 23772050 PMC: 3740623. DOI: 10.1093/bioinformatics/btt341.


Differential principal component analysis of ChIP-seq.

Ji H, Li X, Wang Q, Ning Y Proc Natl Acad Sci U S A. 2013; 110(17):6789-94.

PMID: 23569280 PMC: 3637734. DOI: 10.1073/pnas.1204398110.


References
1.
Orlando V, Paro R . Mapping Polycomb-repressed domains in the bithorax complex using in vivo formaldehyde cross-linked chromatin. Cell. 1993; 75(6):1187-98. DOI: 10.1016/0092-8674(93)90328-n. View

2.
Solomon M, Larsen P, Varshavsky A . Mapping protein-DNA interactions in vivo with formaldehyde: evidence that histone H4 is retained on a highly transcribed gene. Cell. 1988; 53(6):937-47. DOI: 10.1016/s0092-8674(88)90469-2. View

3.
Ji H, Jiang H, Ma W, Johnson D, Myers R, Wong W . An integrated software system for analyzing ChIP-chip and ChIP-seq data. Nat Biotechnol. 2008; 26(11):1293-300. PMC: 2596672. DOI: 10.1038/nbt.1505. View

4.
Iyer V, Horak C, Scafe C, Botstein D, Snyder M, Brown P . Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF. Nature. 2001; 409(6819):533-8. DOI: 10.1038/35054095. View

5.
Huber W, Toedling J, Steinmetz L . Transcript mapping with high-density oligonucleotide tiling arrays. Bioinformatics. 2006; 22(16):1963-70. DOI: 10.1093/bioinformatics/btl289. View