» Articles » PMID: 36812605

An Integrated Analysis of the Cancer Genome Atlas Data Discovers a Hierarchical Association Structure Across Thirty Three Cancer Types

Overview
Date 2023 Feb 22
PMID 36812605
Authors
Affiliations
Soon will be listed here.
Abstract

Cancer cells harbor molecular alterations at all levels of information processing. Genomic/epigenomic and transcriptomic alterations are inter-related between genes, within and across cancer types and may affect clinical phenotypes. Despite the abundant prior studies of integrating cancer multi-omics data, none of them organizes these associations in a hierarchical structure and validates the discoveries in extensive external data. We infer this Integrated Hierarchical Association Structure (IHAS) from the complete data of The Cancer Genome Atlas (TCGA) and compile a compendium of cancer multi-omics associations. Intriguingly, diverse alterations on genomes/epigenomes from multiple cancer types impact transcriptions of 18 Gene Groups. Half of them are further reduced to three Meta Gene Groups enriched with (1) immune and inflammatory responses, (2) embryonic development and neurogenesis, (3) cell cycle process and DNA repair. Over 80% of the clinical/molecular phenotypes reported in TCGA are aligned with the combinatorial expressions of Meta Gene Groups, Gene Groups, and other IHAS subunits. Furthermore, IHAS derived from TCGA is validated in more than 300 external datasets including multi-omics measurements and cellular responses upon drug treatments and gene perturbations in tumors, cancer cell lines, and normal tissues. To sum up, IHAS stratifies patients in terms of molecular signatures of its subunits, selects targeted genes or drugs for precision cancer therapy, and demonstrates that associations between survival times and transcriptional biomarkers may vary with cancer types. These rich information is critical for diagnosis and treatments of cancers.

Citing Articles

MPAC: a computational framework for inferring cancer pathway activities from multi-omic data.

Liu P, Page D, Ahlquist P, Ong I, Gitter A bioRxiv. 2024; .

PMID: 38948762 PMC: 11212914. DOI: 10.1101/2024.06.15.599113.


Assessing transcriptomic heterogeneity of single-cell RNASeq data by bulk-level gene expression data.

Tiong K, Luzhbin D, Yeang C BMC Bioinformatics. 2024; 25(1):209.

PMID: 38867193 PMC: 11167951. DOI: 10.1186/s12859-024-05825-3.

References
1.
Subramanian A, Tamayo P, Mootha V, Mukherjee S, Ebert B, Gillette M . Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005; 102(43):15545-50. PMC: 1239896. DOI: 10.1073/pnas.0506580102. View

2.
Chen M, Li J, Wang Y, Akbani R, Lu Y, Mills G . TCPA v3.0: An Integrative Platform to Explore the Pan-Cancer Analysis of Functional Proteomic Data. Mol Cell Proteomics. 2019; 18(8 suppl 1):S15-S25. PMC: 6692772. DOI: 10.1074/mcp.RA118.001260. View

3.
Akavia U, Litvin O, Kim J, Sanchez-Garcia F, Kotliar D, Causton H . An integrated approach to uncover drivers of cancer. Cell. 2010; 143(6):1005-17. PMC: 3013278. DOI: 10.1016/j.cell.2010.11.013. View

4.
Ruan P, Wang Y, Shen R, Wang S . Using association signal annotations to boost similarity network fusion. Bioinformatics. 2019; 35(19):3718-3726. PMC: 6761966. DOI: 10.1093/bioinformatics/btz124. View

5.
Jackson H, Fischer J, Zanotelli V, Ali H, Mechera R, Soysal S . The single-cell pathology landscape of breast cancer. Nature. 2020; 578(7796):615-620. DOI: 10.1038/s41586-019-1876-x. View