» Articles » PMID: 37370847

Identification of Novel Diagnostic and Prognostic Gene Signature Biomarkers for Breast Cancer Using Artificial Intelligence and Machine Learning Assisted Transcriptomics Analysis

Overview
Journal Cancers (Basel)
Publisher MDPI
Specialty Oncology
Date 2023 Jun 28
PMID 37370847
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Breast cancer (BC) is one of the most common female cancers. Clinical and histopathological information is collectively used for diagnosis, but is often not precise. We applied machine learning (ML) methods to identify the valuable gene signature model based on differentially expressed genes (DEGs) for BC diagnosis and prognosis.

Methods: A cohort of 701 samples from 11 GEO BC microarray datasets was used for the identification of significant DEGs. Seven ML methods, including RFECV-LR, RFECV-SVM, LR-L1, SVC-L1, RF, and Extra-Trees were applied for gene reduction and the construction of a diagnostic model for cancer classification. Kaplan-Meier survival analysis was performed for prognostic signature construction. The potential biomarkers were confirmed via qRT-PCR and validated by another set of ML methods including GBDT, XGBoost, AdaBoost, KNN, and MLP.

Results: We identified 355 DEGs and predicted BC-associated pathways, including kinetochore metaphase signaling, PTEN, senescence, and phagosome-formation pathways. A hub of 28 DEGs and a novel diagnostic nine-gene signature (, , , , , , and ) were identified using stringent filter conditions. Similarly, a novel prognostic model consisting of eight-gene signatures (, , , , , , , and ) was also identified using disease-free survival and overall survival analysis. Gene signatures were validated by another set of ML methods. Finally, qRT-PCR results confirmed the expression of the identified gene signatures in BC.

Conclusion: The ML approach helped construct novel diagnostic and prognostic models based on the expression profiling of BC. The identified nine-gene signature and eight-gene signatures showed excellent potential in BC diagnosis and prognosis, respectively.

Citing Articles

Artificial Intelligence and Breast Cancer Management: From Data to the Clinic.

Feng K, Yi Z, Xu B Cancer Innov. 2025; 4(2):e159.

PMID: 39981497 PMC: 11840326. DOI: 10.1002/cai2.159.


Identification and validation of CCN family genes to predict the prognosis in gastric cancer.

Chen H, Zhang X, Zhang Z, Li G, Li X, Yang S Discov Oncol. 2024; 15(1):610.

PMID: 39485579 PMC: 11530581. DOI: 10.1007/s12672-024-01459-2.


Contribution of AurkA/TPX2 Overexpression to Chromosomal Imbalances and Cancer.

Polverino F, Mastrangelo A, Guarguaglini G Cells. 2024; 13(16).

PMID: 39195284 PMC: 11353082. DOI: 10.3390/cells13161397.


An overview of CCN4 (WISP1) role in human diseases.

Singh K, Oladipupo S J Transl Med. 2024; 22(1):601.

PMID: 38937782 PMC: 11212430. DOI: 10.1186/s12967-024-05364-8.


AITeQ: a machine learning framework for Alzheimer's prediction using a distinctive five-gene signature.

Ahammad I, Lamisa A, Bhattacharjee A, Jamal T, Arefin M, Chowdhury Z Brief Bioinform. 2024; 25(4).

PMID: 38877887 PMC: 11179120. DOI: 10.1093/bib/bbae291.


References
1.
Hinton G, Osindero S, Teh Y . A fast learning algorithm for deep belief nets. Neural Comput. 2006; 18(7):1527-54. DOI: 10.1162/neco.2006.18.7.1527. View

2.
Sherman B, Hao M, Qiu J, Jiao X, Baseler M, Lane H . DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic Acids Res. 2022; 50(W1):W216-W221. PMC: 9252805. DOI: 10.1093/nar/gkac194. View

3.
Abuderman A, Harb O, Gertallah L . Prognostic and clinicopathological values of tissue expression of MFAP5 and ITM2A in triple-negative breast cancer: an immunohistochemical study. Contemp Oncol (Pozn). 2020; 24(2):87-95. PMC: 7403766. DOI: 10.5114/wo.2020.97520. View

4.
Ramaswamy S, Tamayo P, Rifkin R, Mukherjee S, Yeang C, Angelo M . Multiclass cancer diagnosis using tumor gene expression signatures. Proc Natl Acad Sci U S A. 2001; 98(26):15149-54. PMC: 64998. DOI: 10.1073/pnas.211566398. View

5.
Georgescu M . PTEN Tumor Suppressor Network in PI3K-Akt Pathway Control. Genes Cancer. 2011; 1(12):1170-7. PMC: 3092286. DOI: 10.1177/1947601911407325. View