» Articles » PMID: 25475756

Pathway Activity Inference for Multiclass Disease Classification Through a Mathematical Programming Optimisation Framework

Overview
Publisher Biomed Central
Specialty Biology
Date 2014 Dec 6
PMID 25475756
Citations 5
Authors
Affiliations
Soon will be listed here.
Abstract

Background: Applying machine learning methods on microarray gene expression profiles for disease classification problems is a popular method to derive biomarkers, i.e. sets of genes that can predict disease state or outcome. Traditional approaches where expression of genes were treated independently suffer from low prediction accuracy and difficulty of biological interpretation. Current research efforts focus on integrating information on protein interactions through biochemical pathway datasets with expression profiles to propose pathway-based classifiers that can enhance disease diagnosis and prognosis. As most of the pathway activity inference methods in literature are either unsupervised or applied on two-class datasets, there is good scope to address such limitations by proposing novel methodologies.

Results: A supervised multiclass pathway activity inference method using optimisation techniques is reported. For each pathway expression dataset, patterns of its constituent genes are summarised into one composite feature, termed pathway activity, and a novel mathematical programming model is proposed to infer this feature as a weighted linear summation of expression of its constituent genes. Gene weights are determined by the optimisation model, in a way that the resulting pathway activity has the optimal discriminative power with regards to disease phenotypes. Classification is then performed on the resulting low-dimensional pathway activity profile.

Conclusions: The model was evaluated through a variety of published gene expression profiles that cover different types of disease. We show that not only does it improve classification accuracy, but it can also perform well in multiclass disease datasets, a limitation of other approaches from the literature. Desirable features of the model include the ability to control the maximum number of genes that may participate in determining pathway activity, which may be pre-specified by the user. Overall, this work highlights the potential of building pathway-based multi-phenotype classifiers for accurate disease diagnosis and prognosis problems.

Citing Articles

Interpretable optimisation-based approach for hyper-box classification.

Liapis G, Tsoka S, Papageorgiou L Mach Learn. 2025; 114(3):51.

PMID: 40017483 PMC: 11861270. DOI: 10.1007/s10994-024-06643-7.


Prediction of anticancer drug sensitivity using an interpretable model guided by deep learning.

Pang W, Chen M, Qin Y BMC Bioinformatics. 2024; 25(1):182.

PMID: 38724920 PMC: 11080240. DOI: 10.1186/s12859-024-05669-x.


Optimisation Models for Pathway Activity Inference in Cancer.

Chen Y, Liu S, Papageorgiou L, Theofilatos K, Tsoka S Cancers (Basel). 2023; 15(6).

PMID: 36980673 PMC: 10046797. DOI: 10.3390/cancers15061787.


Enhanced Directed Random Walk for the Identification of Breast Cancer Prognostic Markers from Multiclass Expression Data.

Nies H, Mohamad M, Zakaria Z, Chan W, Remli M, Nies Y Entropy (Basel). 2021; 23(9).

PMID: 34573857 PMC: 8472068. DOI: 10.3390/e23091232.


Predicting Drug Response and Synergy Using a Deep Learning Model of Human Cancer Cells.

Kuenzi B, Park J, Fong S, Sanchez K, Lee J, Kreisberg J Cancer Cell. 2020; 38(5):672-684.e6.

PMID: 33096023 PMC: 7737474. DOI: 10.1016/j.ccell.2020.09.014.


References
1.
Ein-Dor L, Kela I, Getz G, Givol D, Domany E . Outcome signature genes in breast cancer: is there a unique set?. Bioinformatics. 2004; 21(2):171-8. DOI: 10.1093/bioinformatics/bth469. View

2.
Liberzon A, Subramanian A, Pinchback R, Thorvaldsdottir H, Tamayo P, Mesirov J . Molecular signatures database (MSigDB) 3.0. Bioinformatics. 2011; 27(12):1739-40. PMC: 3106198. DOI: 10.1093/bioinformatics/btr260. View

3.
Khan J, Wei J, Ringner M, Saal L, Ladanyi M, Westermann F . Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks. Nat Med. 2001; 7(6):673-9. PMC: 1282521. DOI: 10.1038/89044. View

4.
Lamb R, Ablett M, Spence K, Landberg G, Sims A, Clarke R . Wnt pathway activity in breast cancer sub-types and stem-like cells. PLoS One. 2013; 8(7):e67811. PMC: 3701602. DOI: 10.1371/journal.pone.0067811. View

5.
Sorlie T, Perou C, Tibshirani R, Aas T, Geisler S, Johnsen H . Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications. Proc Natl Acad Sci U S A. 2001; 98(19):10869-74. PMC: 58566. DOI: 10.1073/pnas.191367098. View