» Articles » PMID: 35672401

Somatic Point Mutations Are Enriched in Non-coding RNAs with Possible Regulatory Function in Breast Cancer

Overview
Journal Commun Biol
Specialty Biology
Date 2022 Jun 7
PMID 35672401
Authors
Affiliations
Soon will be listed here.
Abstract

Non-coding RNAs (ncRNAs) form a large portion of the mammalian genome. However, their biological functions are poorly characterized in cancers. In this study, using a newly developed tool, SomaGene, we analyze de novo somatic point mutations from the International Cancer Genome Consortium (ICGC) whole-genome sequencing data of 1,855 breast cancer samples. We identify 1030 candidates of ncRNAs that are significantly and explicitly mutated in breast cancer samples. By integrating data from the ENCODE regulatory features and FANTOM5 expression atlas, we show that the candidate ncRNAs significantly enrich active chromatin histone marks (1.9 times), CTCF binding sites (2.45 times), DNase accessibility (1.76 times), HMM predicted enhancers (2.26 times) and eQTL polymorphisms (1.77 times). Importantly, we show that the 1030 ncRNAs contain a much higher level (3.64 times) of breast cancer-associated genome-wide association (GWAS) single nucleotide polymorphisms (SNPs) than genome-wide expectation. Such enrichment has not been seen with GWAS SNPs from other cancers. Using breast cell line related Hi-C data, we then show that 82% of our candidate ncRNAs (1.9 times) significantly interact with the promoter of protein-coding genes, including previously known cancer-associated genes, suggesting the critical role of candidate ncRNA genes in the activation of essential regulators of development and differentiation in breast cancer. We provide an extensive web-based resource ( https://www.ihealthe.unsw.edu.au/research ) to communicate our results with the research community. Our list of breast cancer-specific ncRNA genes has the potential to provide a better understanding of the underlying genetic causes of breast cancer. Lastly, the tool developed in this study can be used to analyze somatic mutations in all cancers.

Citing Articles

Multi-Omics Mining of lncRNAs with Biological and Clinical Relevance in Cancer.

Salido-Guadarrama I, Romero-Cordoba S, Rueda-Zarazua B Int J Mol Sci. 2023; 24(23).

PMID: 38068923 PMC: 10706612. DOI: 10.3390/ijms242316600.


Extensive review on breast cancer its etiology, progression, prognostic markers, and treatment.

Swaminathan H, Saravanamurali K, Yadav S Med Oncol. 2023; 40(8):238.

PMID: 37442848 DOI: 10.1007/s12032-023-02111-9.


Harnessing deep learning into hidden mutations of neurological disorders for therapeutic challenges.

Yang S, Kim S, Kang M, Joo J Arch Pharm Res. 2023; 46(6):535-549.

PMID: 37261600 DOI: 10.1007/s12272-023-01450-5.

References
1.
Alinejad-Rokny H, Anwar F, Waters S, Davenport M, Ebrahimi D . Source of CpG Depletion in the HIV-1 Genome. Mol Biol Evol. 2016; 33(12):3205-3212. DOI: 10.1093/molbev/msw205. View

2.
Gupta R, Shah N, Wang K, Kim J, Horlings H, Wong D . Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis. Nature. 2010; 464(7291):1071-6. PMC: 3049919. DOI: 10.1038/nature08975. View

3.
. An integrated encyclopedia of DNA elements in the human genome. Nature. 2012; 489(7414):57-74. PMC: 3439153. DOI: 10.1038/nature11247. View

4.
Dong Y, Zhang T, Li X, Yu F, Guo Y . Comprehensive analysis of coexpressed long noncoding RNAs and genes in breast cancer. J Obstet Gynaecol Res. 2018; 45(2):428-437. DOI: 10.1111/jog.13840. View

5.
Afrasiabi A, Keane J, Heng J, Palmer E, Lovell N, Alinejad-Rokny H . Quantitative neurogenetics: applications in understanding disease. Biochem Soc Trans. 2021; 49(4):1621-1631. DOI: 10.1042/BST20200732. View