» Articles » PMID: 26771021

The Molecular Signatures Database (MSigDB) Hallmark Gene Set Collection

Overview
Journal Cell Syst
Publisher Cell Press
Date 2016 Jan 16
PMID 26771021
Citations 6010
Authors
Affiliations
Soon will be listed here.
Abstract

The Molecular Signatures Database (MSigDB) is one of the most widely used and comprehensive databases of gene sets for performing gene set enrichment analysis. Since its creation, MSigDB has grown beyond its roots in metabolic disease and cancer to include >10,000 gene sets. These better represent a wider range of biological processes and diseases, but the utility of the database is reduced by increased redundancy across, and heterogeneity within, gene sets. To address this challenge, here we use a combination of automated approaches and expert curation to develop a collection of "hallmark" gene sets as part of MSigDB. Each hallmark in this collection consists of a "refined" gene set, derived from multiple "founder" sets, that conveys a specific biological state or process and displays coherent expression. The hallmarks effectively summarize most of the relevant information of the original founder sets and, by reducing both variation and redundancy, provide more refined and concise inputs for gene set enrichment analysis.

Citing Articles

Single-cell and chromatin accessibility profiling reveals regulatory programs of pathogenic Th2 cells in allergic asthma.

Khan M, Alteneder M, Reiter W, Krausgruber T, Dobnikar L, Madern M Nat Commun. 2025; 16(1):2565.

PMID: 40089475 DOI: 10.1038/s41467-025-57590-3.


Combination therapy with Chicoric acid and PD-1/PD-L1 blockade improves the immunotherapy response in patient-derived ovarian cancer xenograft model.

Lan H, Zhu J, Hou H, Zhang C, Huo X, Zhang Y Cell Commun Signal. 2025; 23(1):137.

PMID: 40087780 DOI: 10.1186/s12964-025-02146-7.


NLRP4 unlocks an NK/macrophages-centered ecosystem to suppress non-small cell lung cancer.

Meng Z, Li J, Wang H, Cao Z, Lu W, Niu X Biomark Res. 2025; 13(1):44.

PMID: 40087771 DOI: 10.1186/s40364-025-00756-4.


Astrocytic pleiotrophin deficiency in the prefrontal cortex contributes to stress-induced depressive-like responses in male mice.

Chi D, Zhang K, Zhang J, He Z, Zhou H, Huang W Nat Commun. 2025; 16(1):2528.

PMID: 40087317 DOI: 10.1038/s41467-025-57924-1.


Exploring the potential mechanisms of sorafenib resistance in hepatocellular carcinoma cell lines based on RNA sequencing.

Sun M, Zhang Z, Chen C, Zhong J, Long Z, Shen L Cancer Cell Int. 2025; 25(1):91.

PMID: 40082884 PMC: 11907981. DOI: 10.1186/s12935-025-03728-8.


References
1.
Subramanian A, Kuehn H, Gould J, Tamayo P, Mesirov J . GSEA-P: a desktop application for Gene Set Enrichment Analysis. Bioinformatics. 2007; 23(23):3251-3. DOI: 10.1093/bioinformatics/btm369. View

2.
Ferbeyre G, Moriggl R . The role of Stat5 transcription factors as tumor suppressors or oncogenes. Biochim Biophys Acta. 2010; 1815(1):104-14. DOI: 10.1016/j.bbcan.2010.10.004. View

3.
Subramanian A, Tamayo P, Mootha V, Mukherjee S, Ebert B, Gillette M . Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005; 102(43):15545-50. PMC: 1239896. DOI: 10.1073/pnas.0506580102. View

4.
Akhurst R, Hata A . Targeting the TGFβ signalling pathway in disease. Nat Rev Drug Discov. 2012; 11(10):790-811. PMC: 3520610. DOI: 10.1038/nrd3810. View

5.
Cho Y, Tsherniak A, Tamayo P, Santagata S, Ligon A, Greulich H . Integrative genomic analysis of medulloblastoma identifies a molecular subgroup that drives poor clinical outcome. J Clin Oncol. 2010; 29(11):1424-30. PMC: 3082983. DOI: 10.1200/JCO.2010.28.5148. View