» Articles » PMID: 26921029

Regularized Logistic Regression with Network-based Pairwise Interaction for Biomarker Identification in Breast Cancer

Overview
Publisher Biomed Central
Specialty Biology
Date 2016 Feb 28
PMID 26921029
Citations 7
Authors
Affiliations
Soon will be listed here.
Abstract

Background: To facilitate advances in personalized medicine, it is important to detect predictive, stable and interpretable biomarkers related with different clinical characteristics. These clinical characteristics may be heterogeneous with respect to underlying interactions between genes. Usually, traditional methods just focus on detection of differentially expressed genes without taking the interactions between genes into account. Moreover, due to the typical low reproducibility of the selected biomarkers, it is difficult to give a clear biological interpretation for a specific disease. Therefore, it is necessary to design a robust biomarker identification method that can predict disease-associated interactions with high reproducibility.

Results: In this article, we propose a regularized logistic regression model. Different from previous methods which focus on individual genes or modules, our model takes gene pairs, which are connected in a protein-protein interaction network, into account. A line graph is constructed to represent the adjacencies between pairwise interactions. Based on this line graph, we incorporate the degree information in the model via an adaptive elastic net, which makes our model less dependent on the expression data. Experimental results on six publicly available breast cancer datasets show that our method can not only achieve competitive performance in classification, but also retain great stability in variable selection. Therefore, our model is able to identify the diagnostic and prognostic biomarkers in a more robust way. Moreover, most of the biomarkers discovered by our model have been verified in biochemical or biomedical researches.

Conclusions: The proposed method shows promise in the diagnosis of disease pathogenesis with different clinical characteristics. These advances lead to more accurate and stable biomarker discovery, which can monitor the functional changes that are perturbed by diseases. Based on these predictions, researchers may be able to provide suggestions for new therapeutic approaches.

Citing Articles

Artificial intelligence in cancer target identification and drug discovery.

You Y, Lai X, Pan Y, Zheng H, Vera J, Liu S Signal Transduct Target Ther. 2022; 7(1):156.

PMID: 35538061 PMC: 9090746. DOI: 10.1038/s41392-022-00994-0.


Bayesian Gene Selection Based on Pathway Information and Network-Constrained Regularization.

Cao M, Fan Y, Peng Q Comput Math Methods Med. 2021; 2021:7471516.

PMID: 34394707 PMC: 8360753. DOI: 10.1155/2021/7471516.


Age-dependent co-dependency structure of biomarkers in the general population of the United States.

Le Goallec A, Patel C Aging (Albany NY). 2019; 11(5):1404-1426.

PMID: 30822279 PMC: 6428110. DOI: 10.18632/aging.101842.


Elastic net-based prediction of IFN-β treatment response of patients with multiple sclerosis using time series microarray gene expression profiles.

Fukushima A, Sugimoto M, Hiwa S, Hiroyasu T Sci Rep. 2019; 9(1):1822.

PMID: 30755676 PMC: 6372673. DOI: 10.1038/s41598-018-38441-2.


Network-based logistic regression integration method for biomarker identification.

Zhang K, Geng W, Zhang S BMC Syst Biol. 2019; 12(Suppl 9):135.

PMID: 30598085 PMC: 6311907. DOI: 10.1186/s12918-018-0657-8.


References
1.
Pawitan Y, Bjohle J, Amler L, Borg A, Egyhazi S, Hall P . Gene expression profiling spares early breast cancer patients from adjuvant therapy: derived and validated in two population-based cohorts. Breast Cancer Res. 2005; 7(6):R953-64. PMC: 1410752. DOI: 10.1186/bcr1325. View

2.
Sun H, Wang S . Penalized logistic regression for high-dimensional DNA methylation data with case-control studies. Bioinformatics. 2012; 28(10):1368-75. PMC: 3348559. DOI: 10.1093/bioinformatics/bts145. View

3.
Xu J, Li Y . Discovering disease-genes by topological features in human protein-protein interaction network. Bioinformatics. 2006; 22(22):2800-5. DOI: 10.1093/bioinformatics/btl467. View

4.
Hamp T, Rost B . More challenges for machine-learning protein interactions. Bioinformatics. 2015; 31(10):1521-5. DOI: 10.1093/bioinformatics/btu857. View

5.
Kim S, Pan W, Shen X . Network-based penalized regression with application to genomic data. Biometrics. 2013; 69(3):582-93. PMC: 4007772. DOI: 10.1111/biom.12035. View