» Articles » PMID: 23369435

Recursive SVM Biomarker Selection for Early Detection of Breast Cancer in Peripheral Blood

Overview
Publisher Biomed Central
Specialty Genetics
Date 2013 Feb 2
PMID 23369435
Citations 20
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Breast cancer is worldwide the second most common type of cancer after lung cancer. Traditional mammography and Tissue Microarray has been studied for early cancer detection and cancer prediction. However, there is a need for more reliable diagnostic tools for early detection of breast cancer. This can be a challenge due to a number of factors and logistics. First, obtaining tissue biopsies can be difficult. Second, mammography may not detect small tumors, and is often unsatisfactory for younger women who typically have dense breast tissue. Lastly, breast cancer is not a single homogeneous disease but consists of multiple disease states, each arising from a distinct molecular mechanism and having a distinct clinical progression path which makes the disease difficult to detect and predict in early stages.

Results: In the paper, we present a Support Vector Machine based on Recursive Feature Elimination and Cross Validation (SVM-RFE-CV) algorithm for early detection of breast cancer in peripheral blood and show how to use SVM-RFE-CV to model the classification and prediction problem of early detection of breast cancer in peripheral blood.The training set which consists of 32 health and 33 cancer samples and the testing set consisting of 31 health and 34 cancer samples were randomly separated from a dataset of peripheral blood of breast cancer that is downloaded from Gene Express Omnibus. First, we identified the 42 differentially expressed biomarkers between "normal" and "cancer". Then, with the SVM-RFE-CV we extracted 15 biomarkers that yield zero cross validation score. Lastly, we compared the classification and prediction performance of SVM-RFE-CV with that of SVM and SVM Recursive Feature Elimination (SVM-RFE).

Conclusions: We found that 1) the SVM-RFE-CV is suitable for analyzing noisy high-throughput microarray data, 2) it outperforms SVM-RFE in the robustness to noise and in the ability to recover informative features, and 3) it can improve the prediction performance (Area Under Curve) in the testing data set from 0.5826 to 0.7879. Further pathway analysis showed that the biomarkers are associated with Signaling, Hemostasis, Hormones, and Immune System, which are consistent with previous findings. Our prediction model can serve as a general model for biomarker discovery in early detection of other cancers. In the future, Polymerase Chain Reaction (PCR) is planned for validation of the ability of these potential biomarkers for early detection of breast cancer.

Citing Articles

Optimizing hybrid ensemble feature selection strategies for transcriptomic biomarker discovery in complex diseases.

Claude E, Leclercq M, Thebault P, Droit A, Uricaru R NAR Genom Bioinform. 2024; 6(3):lqae079.

PMID: 38993634 PMC: 11237901. DOI: 10.1093/nargab/lqae079.


A Two-Phase Feature Selection Method for Identifying Influential Spreaders of Disease Epidemics in Complex Networks.

Wang X, Han Y, Wang B Entropy (Basel). 2023; 25(7).

PMID: 37510015 PMC: 10378310. DOI: 10.3390/e25071068.


A Boolean-based machine learning framework identifies predictive biomarkers of HSP90-targeted therapy response in prostate cancer.

Shin S, Centenera M, Hodgson J, Nguyen E, Butler L, Daly R Front Mol Biosci. 2023; 10:1094321.

PMID: 36743211 PMC: 9892654. DOI: 10.3389/fmolb.2023.1094321.


Gene expression analysis in endometriosis: Immunopathology insights, transcription factors and therapeutic targets.

Geng R, Huang X, Li L, Guo X, Wang Q, Zheng Y Front Immunol. 2022; 13:1037504.

PMID: 36532015 PMC: 9748153. DOI: 10.3389/fimmu.2022.1037504.


Combination of Serum and Plasma Biomarkers Could Improve Prediction Performance for Alzheimer's Disease.

Zhang F, Petersen M, Johnson L, Hall J, OBryant S Genes (Basel). 2022; 13(10.

PMID: 36292623 PMC: 9601501. DOI: 10.3390/genes13101738.


References
1.
Hammamieh R, Chakraborty N, Barmada M, Das R, Jett M . Expression patterns of fatty acid binding proteins in breast cancer cells. J Exp Ther Oncol. 2006; 5(2):133-43. View

2.
De Santis M, Hammamieh R, Das R, Jett M . Adipocyte-fatty acid binding protein induces apoptosis in DU145 prostate cancer cells. J Exp Ther Oncol. 2004; 4(2):91-100. View

3.
Sharma P, Sahni N, Tibshirani R, Skaane P, Urdal P, Berghagen H . Early detection of breast cancer based on gene-expression patterns in peripheral blood cells. Breast Cancer Res. 2005; 7(5):R634-44. PMC: 1242124. DOI: 10.1186/bcr1203. View

4.
Xie D, Nakachi K, Wang H, Elashoff R, Koeffler H . Elevated levels of connective tissue growth factor, WISP-1, and CYR61 in primary breast cancers associated with more advanced features. Cancer Res. 2001; 61(24):8917-23. View

5.
Polyak K . Breast cancer: origins and evolution. J Clin Invest. 2007; 117(11):3155-63. PMC: 2045618. DOI: 10.1172/JCI33295. View