» Articles » PMID: 31249361

The ENCODE Blacklist: Identification of Problematic Regions of the Genome

Overview
Journal Sci Rep
Specialty Science
Date 2019 Jun 29
PMID 31249361
Citations 741
Authors
Affiliations
Soon will be listed here.
Abstract

Functional genomics assays based on high-throughput sequencing greatly expand our ability to understand the genome. Here, we define the ENCODE blacklist- a comprehensive set of regions in the human, mouse, worm, and fly genomes that have anomalous, unstructured, or high signal in next-generation sequencing experiments independent of cell line or experiment. The removal of the ENCODE blacklist is an essential quality measure when analyzing functional genomics data.

Citing Articles

Lethal co-expression intolerance underlies the mutually exclusive expression of ASCL1 and NEUROD1 in SCLC cells.

Watanabe H, Inoue Y, Tsuchiya K, Asada K, Suzuki M, Ogawa H NPJ Precis Oncol. 2025; 9(1):74.

PMID: 40082639 PMC: 11906894. DOI: 10.1038/s41698-025-00860-6.


Optical genome and epigenome mapping of clear cell renal cell carcinoma.

Margalit S, Tulpova Z, Michaeli Y, Zur T, Deek J, Louzoun-Zada S NAR Cancer. 2025; 7(1):zcaf008.

PMID: 40061565 PMC: 11886815. DOI: 10.1093/narcan/zcaf008.


Epstein-Barr virus hijacks histone demethylase machinery to drive epithelial malignancy progression through KDM5B upregulation.

Zhou Y, Jiang J, He S, Li Y, Cheng X, Liu S Signal Transduct Target Ther. 2025; 10(1):83.

PMID: 40059116 PMC: 11891327. DOI: 10.1038/s41392-025-02163-5.


UTAP2: an enhanced user-friendly transcriptome and epigenome analysis pipeline.

Lindner J, Dassa B, Wigoda N, Stelzer G, Feldmesser E, Prilusky J BMC Bioinformatics. 2025; 26(1):79.

PMID: 40055635 PMC: 11889741. DOI: 10.1186/s12859-025-06090-8.


Multi-ancestry GWAS reveals loci linked to human variation in LINE-1- and Alu-insertion numbers.

Bravo J, Zhang L, Benayoun B Transl Med Aging. 2025; 9:25-40.

PMID: 40051556 PMC: 11883834. DOI: 10.1016/j.tma.2025.02.001.


References
1.
Carroll T, Liang Z, Salama R, Stark R, de Santiago I . Impact of artifact removal on ChIP quality metrics in ChIP-seq and ChIP-exo data. Front Genet. 2014; 5:75. PMC: 3989762. DOI: 10.3389/fgene.2014.00075. View

2.
Boyle A, Araya C, Brdlik C, Cayting P, Cheng C, Cheng Y . Comparative analysis of regulatory information and circuits across distant species. Nature. 2014; 512(7515):453-6. PMC: 4336544. DOI: 10.1038/nature13668. View

3.
Pickrell J, Gaffney D, Gilad Y, Pritchard J . False positive peaks in ChIP-seq and other sequencing-based functional assays caused by unannotated high copy number regions. Bioinformatics. 2011; 27(15):2144-6. PMC: 3137225. DOI: 10.1093/bioinformatics/btr354. View

4.
Yue F, Cheng Y, Breschi A, Vierstra J, Wu W, Ryba T . A comparative encyclopedia of DNA elements in the mouse genome. Nature. 2014; 515(7527):355-64. PMC: 4266106. DOI: 10.1038/nature13992. View

5.
Li W, Freudenberg J . Characterizing regions in the human genome unmappable by next-generation-sequencing at the read length of 1000 bases. Comput Biol Chem. 2014; 53 Pt A:108-17. DOI: 10.1016/j.compbiolchem.2014.08.015. View